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TITIJE OF TBQB INVENTION 

ASSAYS TO MONITOR AMYLOID PRECURSOR PROTEIN PROCESSING 

CROSS-REFERENCE TO RELATED APPUCATIONS 
5 This application claims the benefit of U.S, Provisional Application No. 

60/360,274, filed Febmaiy 27, 2002, the contents of which are incorporated herein by 
reference in their entirety. 

STATEMENT REGARDING FEDERADLY-SPONSORED R&D 
10 Not applicable. 

REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

15 FIELD OF IBE INVENTION 

The present invention is directed to the field of Alzheimer's disease. 
In particular, the present invention provides novel methods of identifying substances 
that are specific inhibitors of various steps in the processing of amyloid precursor 
protein. 

20 

BACKGROUND OF THE INVENTION 

Alzheimer's disease is a common, chronic neurodegenerative disease, 
characterized by a progressive loss of memory and sometimes severe behavioral 
abnormalities, as well as an impainnent of other cognitive functions that often leads to 

25 dementia and death. It ranks as the fourth leading cause of death in industrialized 

societies after heart disease, cancer, and stroke. The incidence of Alzheimer's disease 
is high, with an estimated 2.5 to 4 million patients affected in the United States and 
peifaaps 17 to 25 million worldwide. Moreover, the number of sufferers is expected to 
grow as the population ages. 

30 A characteristic feature of Alzheimer's disease is the presence of large 

numbers of insoluble deposits, known as amyloid plaques, in the brains of those 
affected. Autopsies have shown that amyloid plaques are found in the brains of 
virtually all Alzheimer's patients and that the degree of amyloid plaque deposition 
correlates with the degree of dementia (Cummings & Cotman, 1995, Lancet 
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326: 1524-1587). While some opinion holds that amyloid plaques are a late stage by- 
product of the disease process, the consensus view is that amyloid plaques are more 
likely to be intimately, and perhaps causally, involved in Alzheimer's disease. 

A variety of experimental evidence supports this view. For example, 
5 Ap, a primary component of amyloid plaques, is toxic to neurons in culture and 

transgenic mice that overproduce Ap in their brains show significant deposition of Ap 
into amyloid plaques as well as significant neuronal toxicity (Yankner, 1990, Science 
250:279-282; Mattson et al., 1992, J. Neurosci. 12:379-389; Games et al., 1995, 
Nature 373:523-527; LaFerla et al., 1995, Nature Genetics 9:21-29), Mutations in the 
10 APP gene, leading to increased Ap production, have been linked to heritable forms of 
Alzheuner's disease (Goate et al., 1991, Nature 349:704-706; Chartier-Harlan et al., 

1991, Nature 353:844-846; Munel et al., 1991,Science 254:97-99; MuUan et al., 

1992, Nature Genetics 1:345-347). Presenilin-1 (PSl) and preseniIin-2 ( PS2) related 
familial early-onset Alzheimer's disease (PAD) shows disproportionately increased 

15 production of Apl-42, the 42 amino acid isofonn of AP, as opposed to Api-40, the 
40 amino acid isoform (Scheuner et al, 1996, Nature Medicine 2:864-870). The 
longer isoform of AP is more prone to aggregation than the shorter isofonn (Jarrett et 
al, 1993, Biochemistry 32:4693-4697). Ejection of the insoluble, fibrillar form of Ap 
into monkey brains results in the development of pathology (neuronal destruction, tau 

20 phosphorylation, microglial proliferation) that closely mimics Alzheimer's disease in 
humans (Geula et al., 1998, Nature Medicine 4:827-831). See Selkoe, 1994, J. 
Neuropathol. Exp. Neurol. 53:438-447 for a review of the evidence that amyloid 
plaques have a central role in Alzheimer's disease. 

AP, a 39-43 amino add peptide derived by proteolytic cleavage of the 

25 amyloid precursor protein (APP), is the major component of amyloid plaques 
(Glenner & Wong, 1984, Biochem. Biophys. Res. Comm. 120:885-890). APP is 
actually a family of polypeptides produced by alternative splicing from a single gene. 
Major forms of APP are known as APP695. APP751, and APP770, with the 

subscripts referring to the number of amino acids in each splice variant (Ponte et al., 
30 1988, Nature 331:525-527; Tanzi et al., 1988, Nature 331:528-530; Kitaguchi et al., 
1988, Nature 331:530-532). APP is membrane bound and undergoes proteolytic 
cleavage by at least two pathways. In one pathway, cleavage by an enzyme known as 
a-secretase occurs while APP is still in the trans-Golgi secretory compartment 
(Kuentzel et al., 1993, Biochem. J. 295:367-378). This cleavage by a-secretase 
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occurs within the portion of APP, thus precluding the formation of Ap. In another 
proteolytic pathway, cleavage of the Met596-Asp597 bond (numbered according to 

the 695 amino acid protein) by an enzyme known as P-secretase occurs. This 
cleavage by p-secretase generates the N-tenninus of Ap. The C-tenninus is formed 
5 by cleavage by a second enzyme known as y-secretase. The C-terminus is actually a 
heterogeneous collection of cleavage sites rather than a single site since y-secretase 
activity occurs over a short stretch of APP amino acids rather than at a single peptide 
bond. Peptides of 40 or 42 amino acids in length (Apl-40 and Apl-42, respectively) 
predominate among the C-termini generated by y-secretase. APl-42 is more prone to 

10 aggregation than AP1--40, is the major component of amyloid plaque (Janett et al., 
1993, Biochemistry 32:4693-4697; Kuo et al., 1996, J. Biol. Chem. 271:4077-4081), 
and its production is closely associated with the development of Alzheimer's disease 
(Sinha & lieberburg, 1999, Proc. Natl. Acad. Sci. USA 96:11049-11053). The bond 
cleaved by y-secretase appears to be situated within the transmembrane domain of 

15 APP. It is unclear as to whether the C-termini of Apl-40 and AP1--42 are generated 
by a single y-secietase protease with sloppy specificity or by two distinct proteases. 
For a review that discusses APP and its processing, see Selkoe, 1998, Trends Cell. 
Biol. 8:447^53, 

Much interest has focused on the possibility of inhibiting the 

20 development of amyloid plaques as a means of preventing or ameliorating the 

symptoms of Alzheimer's disease. To that end, a promising strategy is to inhibit the 
activity of p- and y-secretase, the two enzymes that together are responsible for 
producing Ap. This strategy is attractive because, if the formation of amyloid plaques 
as a result of the deposition of AP is a cause of Alzheimer's disease, inhibiting the 

25 activity of one or both of the two secretases would intervene in the disease process at 
an early stage, before late-stage events such as inflammation or apoptosis occur. Such 
early stage intervention is expected to be particularly beneficial (see, e.g.. Citron, 
2000, Molecular Medicine Today 6:392-397). 

To that end, various assays have been developed that are directed to the 

30 identification of compounds that may interfere with the production of AP or its 

deposition into amyloid plaques. U.S. Patent No. 5,441,870 is directed to methods of 
monitoring the processing of APP by detecting the production of amino terminal 
firagments of APP. U.S. Patent No. 5,605,811 is directed to methods of identifying 
inhibitors of the production of amino terminal jBragments of APP. U.S. Patent No. 
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5,593,846 is diiected to methods of detecting soluble by the use of binding 
substances such as antibodies. Esler et al., 1997, Nature Biotechnology 15:258-263 
described an assay that monitored the deposition of AP from solution onto a synthetic 
analogue of an amyloid plaque. The assay was suitable for identifying compounds 
5 that could inhibit the deposition of Ap. However, this assay is not suitable for 

identifying substances, such as inhibitors of P- or y-secretase, that would prevent the 
formation of Ap. 

Various groups have cloned and sequenced cDNA encoding a protein 
that is believed to be P-secretase (Vassar et al., 1999, Science 286:735-741; Hussain 

10 et al., 1999, MoL Cell. Neurosci. 14:419-427; Yan et al., 1999, Nature 402:533-537; 
Sinha et al., 1999, Nature 402:537-540; Lin et al., 2000, Proc. Natl, Acad. Sci. USA 
97: 1456-1460) but the identity of y-secretase has been more elusive. A pair of 
protdns known as presenilin-l and presenilin-2 are viewed as possible candidates 
(Selkoe & Wolfe, 2000, Proc. Natl. Acad. Sci. USA 97:5690-5692). 

15 Prcsenilin-l (PSl) and presenilin-2 (PS2) are polytopic membrane 

proteins that are involved in y-secretase-mediated processing of AFP. The most 
conunon cause of familial early-onset Alzheimer's disease is the autosomal dominant 
inheritance of assorted mutations in the PSl gene (Sherrington et al., 1995, Nature 
375:754-760). These PSl mutations lead to increased production of Apl-42 

20 (Scheuner et al., 1996, Nature Medicine 2:864^70; Duff et al., 1996, Nature 
383:710-713; Borchelt et al., 1996, Neuron 17:1005-1013). Similarly, certain 
mutations in PS2 cause familial early-onset Alzheimer's disease and increased 
generation of AP42 (Levy-Lahad et al., 1995, Science 269:970-973). Cultured 
isolated neurons from PSl-defident mice exhibit reduced y-secretase-mediated 

25 cleava^ of APP (De Strooper et al., 1998, Nature 391:387-390). It was suggested 
that PSl might mfluence traffickmg of APP and/or y-secretase or it mi^t play a more 
direct role in proteolytic cleavage of APP. Directed mutagenesis of two conserved 
transmembrane-situated aspartates in PSl was shown to inactivate y-secretase activity 
m cellular assays, suggesting that PSl is either a required diaspartyl cofactor for y- 

30 secretase or is itself y-secretase (Wolfe et al., 1999, Nature 398:513-517). Moreover, 
li et al., 2000, Nature 405:689-694 made photoactivatable derivatives of a highly 
specific and potent aspartyl protease transition state analog inhibitor and found that 
the inhibitor selectively labeled presenilin fragments. 



-4- 



wo 03/072041 



PCT/US03/05458 



Co-immunopiecipitation experiments have shown that PSl and PS2 
interact directly with the immature forms of APP in the endoplasmic reticulum where 
the disease-associated amyloid Api-42 peptide is probably generated pfla et al., 1997 
Proc. Nafl. Acad. Sci. USA 94:8208-8213; Weidemann et al., 1997, Nat. Med. 3:328- 
5 332). Knock-out of PS 1 activity greatly diminishes y-secretase cleavage of APP (De 
Strooper et al., 1998, Nature 391:387-390). PSl knock-outs do not exhibit total lack 
of Y-secretase activity but knock-out of both PSl and PS2 activity does result in a total 
loss of Y-secretase activity (Herreman et al., 2000, Nat. Cell. Biol. 2:461-462; Zhang 
et al., 2000, Nat. Cell Biol 2:463-465), suggesting that PS2 has a similar function to 

10 PSl in the piwessing of APP. 

KarlstrOm et al., (Journal of Biological Chemistry papers in press, 
published on December 13, 2001 as Manuscript C100649200) describes an assay 
designed specifically to identify inhibitors of Y-secretase cleavage of APP. The 
authors ins^ed the GAI4 DNA binding domain fused to the VP16 tiansactivation 

15 domain into C99, a portion of APP containing the 99 caiboxy-terminal amino acids. 
This fragment of APP contains the Y-secretase cleavage site but lacks the p-secretase 
cleavage site. Transaction of a UAS reporter plasmid by GAL4-VP16 confirmed 
cleavage of the Gal4-VP16/C99 substrate by Y-secretase only. Thus, the assay is 
capable of detecting Y-secretase inhibitors but not inhibitors of p-secretase or other 

20 modulators of APP processing requiring the N-tenninal domaiu of APP. 

Cao & SUdhoff, 2001, Science 293:115-120 described work in which 
the GAL4 and LexA DNA binding domains were inserted into APP to demonstrate 
the potential of the cleaved C-terminus of APP for transcriptional co-activation. In 
this article, a transcriptional factor was not fused to APP and no attempt was made to 

25 develop an assay for the identification of APP processing inhibitors. 

Sisodia, 1992, Proc. Natl. Acad. Sci. USA 89:6975-6979 described 
various changes in the amino acid sequence of APP in ttie region of the a-secretase 
cleavage site and die effect of those changes on cleavage by oc-secretase. A change of 
K to V at position 612 of the 695 amino acid version of APP led to reduced cleavage 

30 by a-secretase. 

U.S. Patent No. 6,333,167 Bl discloses an assay involving DNA 
constructs encoding portions of membrane proteins containing sites that are 
susceptible to cleavage by proteases that are fused to transcriptional repressors. Such 
constructs are introduced into cells that contain a reporter gene under the control of a 
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promote that is sensitive to the lepiessor. In the absence of an inhibitor of the 
protease, the fusion protein is cleaved by the protease, releasing a membrane 
protem/iepressor fusion protein that translocates to the nucleus and represses 
transcription from the reporter gene. In the presence of an inhibitor of the protease, 
5 the membrane protein/repressor fusion protein is not released and thus cannot repress 
transcription from the reporter. An increase in reporter expression can therefore be 
used as a readout for the presence of an inhibitor. 

SUMMARY OF THE INVENTION 

10 The present invention is directed to methods of identifying inhibitors of 

the processing of amyloid preciirsor protein (APP) that are capable of identifying 
inhibitors of a number of steps of such processing. Unlite prior methods, the methods 
of the present invention can be used to screen for inhibitors of P-secretase cleavage, y- 
secretase cleavage, APP extracellular signaling, or APP cytoplasmic signaling in a 

15 single assay. 

The methods employ a recombinant eukaryotic cell that is capable of 
processing APP. The cell has been engineered to express a fusion protein that 
contains amino acid sequences encompassing both the P-secretase cleavage site of 
APP and the y-secretase cleavage site. The fusion protein also contains a transcription 

20 factor fused in frame to the APP sequences. 

When the recombinant cell is further engineered to contain a reporter 
gene, in which transcription of the reporter gene is driven by a regulatory DNA 
sequence that is inactive in the absence of the transcription factor but active in the 
transcription factor's presence, a system useful for screening for APP processing 

25 inhibitors is provided. Since the recombinant cell has been selected so as to be 
capable of processing APP, the fusion protein will be processed, releasing the 
transcription factor and activating transcription of the reporter gene. The reporter 
gene has been preselected so that activation of the reporter gene leads to a detectable 
phenotype. 

30 The system is utilized by exposing the recombinant cell to substances 

that are to be tested for the ability to inhibit APP processing. Those substances that 
are actually inhibitors of APP processing will cause diminished processing of the 
fusion protein, leading to smaller amounts of the transcription factor being released. 
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This leads to less transcription of the reporter gene. This results in a decrease in the 
phenotypic effect of the reporter gene that can be observed. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 A-G shows a schematic diagram of several APP/transciiption 

factor fusion constructs. 

Figure 2A-B shows the DNA sequence (SEQ ID NO: 1) of the fusion 
protein APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695). 

Figure 3 shows the amino acid sequence (SEQ ID N0:2) of the fusion 
10 protein APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
cleavage; 5 = linker; 6 = TAT exon I; 7 = linker, 8 = amino acids 664-695 of APP. 

Figure 4A-B shows the DNA sequence (SEQ ID N0:3) of the fusion 
15 protein APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695). 

Figure 5 shows the amino acid sequence (SEQ ID NO:4) of the fusion 
protein APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of p-secretase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
20 cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of APP. 

Figure 6A-C shows the DNA sequence (SEQ ID NO:5) of the fusion 
protein APP(1-651)SW, K612V, GAL4-VP16(delMet) APP (664-695). 

Figure 7 shows the amino acid sequence (SEQ ID NO:6) of the fusion 
protein APP(1-651)SW, K612V, GAI4-VP16(delMet) APP (664-695) with the 
25 various different regions of the fusion protein demarcated. 1 = amino acids 1-651 of 
APP; 2 = region of P-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker, 8 = amino acids 664-695 
of APP. 

Figure 8 A-C shows the DNA sequence (SEQ ID N0:7) of the fusion 
30 protem APP(l-651)wt, K612V, GAL4-VP16(del Met) APP (664-695). 

Figure 9 shows the amino acid sequence (SEQ ID NO: 8) of the fusion 
protein APP(l-651)wt, K612V, GAL4-VP16(del Met) APP (664-695) with the 
various different regions of the fusion protein demarcated. 1 = anoino acids 1-651 of 
APP; 2 = region of p-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
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secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker, 8 = amino acids 664-695 
of APR 

Figure lOA-B shows the DNA sequence (SEQ ID N0:9) of the fusion 
protein APP(1-651)SW, TATexonI(MlL) AFP (664-695). 
5 Figure 1 1 shows the amino acid sequence (SEQ ID NO:10) of the 

fusion protein APP(1-65I)SW, TATexonI(MIL) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = TAT exon I; 7 = linker; 8 = amino adds 664-695 of 
10 APP. 

Figure 12A-B shows the DNA sequence (SEQ ID NO: 11) of the fusion 
protem APP(1.651)wt, TATexonI(MlL) APP (664-695). 

Figure 13 shows the amino acid sequence (SEQ ID NO: 12) of the 
fusion protein APP(l-651)wt, TATexonI(MlL) APP (664-695) with the various 
15 different regions of the fusion protein demarcated, 1 = amino acids 1-651 of APP; 2 = 
region of p-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker, 6 = TAT exon I; 7 = linker; 8 = amino acids 664-695 of 
APP. 

Figure 14A-C shows the DNA sequence (SEQ ID NO: 13) of the fusion 
20 protein APP(1-651)SW, GAL4-VP16(delMet) APP (664-695). 

Figure 15 shows the amino acid sequence (SEQ ID NO: 14) of the 
fusion protein APP(1-651)SW, GAL4-VP16(deIM:et) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of p-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
25 secretase cleavage; 5 = linker, 6 = GAL4-VP16; 7 = linker; 8 = amino adds 664-695 
of APP. 

Figure 16A-C shows the DNA sequence (SEQ ID NO: 15) of the fusion 
protein APP(l-651)wt, GAL4-VP16(delMet) APP (664-695). 

Figure 17 shows the amino acid sequence (SEQ ID NO: 16) of the 
30 fusion protein APP(l-651)wt, GAL4-VP16(delMet) APP (664-695) with the various 
different regions of the fusion protein demarcated. 1 = amino acids 1-651 of APP; 2 = 
region of P-secretase cleavage; 3 = wild-type K at position 612; 4 = region of y- 
secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APP. 
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Figure 18A-B shows the cDNA sequence (SEQ ID NO:17) and Figure 
18C shows the amino acid sequence (SEQ ID NO; 18) of the 695 amino acid splice 
variant of wild-type Alzheimer's precursor protein (APP). See GenBank accession 
no. Y00264 and Kang et al., 1987, Nature 325:733-736. 
5 Figure 19 shows data from an embodiment in which the assay of tiie 

present invention was used to identify both a P-secretase inhibitor and a y-secretase 
inhibitor. See Example 3 for details. 

Figure 20 shows a schematic diagram of pCR2.1 Gal4-VP16. 

Figure 21 A shows a schematic diagram of pRBR121. Figure 21B 
10 shows a schematic diagram of pcDNAS.l zeo (+) APP(1-651)SW, K612V- 
TATexonI(MlL) APP (664^95). 

Figure 22A shows a schenaatic diagram of pRBR186. Figure 22B 
shows a schematic diagram of the viral plasmid pNL4-3. Figure 22C shows a 
schematic diagram of pcDNAS.l zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) 
15 APP (664-695) with additional details as compared to Figure 21B, which shows the 
same plasmid. 

Figure 23 shows a schematic diagram of pRS V Kan/Neo res. 
Figure 24 shows a schematic diagram of pUCdSTAT. 
Figure 25 A shows a schematic diagram of pMM321. Figure 25B-D 
20 shows the nucleotide sequence of pMM321 . TTie upper strand is SEQ ID NO: 19. The 
lower strand (SEQ ID NO:20) is the reverse complement of SEQ ID NO: 19, 

Figure 26A shows a schematic diagram of the expression vector 
pcDNA3.1 zeo (+)APP(1-651)SW, K612V-(MlL)TATexonI. This expression vector 
directs the expression of a fusion protein containing the first 651 amino acids of APP 
25 with the Swedish version of the p-secretase cleavage site and the K612V mutation 
fused to the first exon of HIVl TAT. The methionine at position 1 of TAT has been 
changed to leucine. Figure 26B-G shows the nucleotide sequence of pcDNA3.1 zeo 
(+)APP(1-651)SW, K612V-CMlL)TATexonL The upper strand is SEQ ID NO:21. 
The lower strand (SEQ ID NO:22) is the reverse complement of SEQ ID NO:21. 
30 Figure 27A-B shows a schematic diagram depicting general features of 

the present invention. Figure 27 A: The vertical bar represents a fusion protein with 
APP sequences represented as unfilled or lightly shaded portions of the bar. The 
lightly shaded portion represents Ap. "B ACE" indicates the P-secretase cleavage site. 
The dark shaded portion represents the transcription factor fused between APP 
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sequences. The horizontal bar represents a membrane in which the uncleaved fusion 
protein is embedded, e.g., the endoplasmic reticulum. Figure 27B: The transcription 
factor (plus small amounts of APP), having been released from the fusion protein and 
thus the membrane by APP processing, is shown in the nucleus binding to and 
5 activating the regulatory DNA sequence ('Transcription Factor Response Element") 
that controls the expression of the reporter gene. 

Figure 28A-B shows the DNA sequence (SEQ ID NO:23) of the fusion 
protein APP(1-651)NFBV, K612V-TATexonI(MlL) APP (664-695). 

Figure 29 shows the amino acid sequence of a fusion protein (APP(1- 

10 651)NFEV, K612V-TATexonI(MlL) APP (664-695)) (SEQ K) NO:24) containing 
the sequence NFEV at the ^-secretase cleavage site (underlined at 2). The other 
portions of the fusion protein ace indicated as follows: 1 = amino acids 1-651 of APP; 
2 region of P-secietase cleavage; 3 = K612V mutation; 4 = region of y-secretase 
cleavage; 5 = linker, 6 = TAT exon I; 7 = linker, 8 = amino acids 664-695 of APP. 

15 Figure 30A-C shows the DNA sequence (SEQ ID NO:25) of the fusion 

protein APP(1-651)NFEV, K612V, GAU-VP16(delMet) APP (664-695). 

Figure 31 shows the amino acid sequence of a fusion protein (APP(1- 
651)NFEV, K612V, GAL4-VP16(delMet) APP (664-695)) (SEQ ID NO:26) 
containing the sequence NFEV at the P-secretase cleavage site (underlined at 2). The 

20 other portions of the fusion protein are indicated as follows: 1 = amino acids 1-651 of 
APP; 2 = region of P-secretase cleavage; 3 = K612V mutation; 4 = region of y- 
secretase cleavage; 5 = linker, 6 = GAL4-VP16; 7 = linker; 8 = amino acids 664-695 
of APP. 

Figure 32A shows a schematic diagram of pcDNA3.1 zeo (+), a 
25 eukaryotic expression vector that is suitable for use in the present invention. Hguie 
32B-F shows the nucleotide sequence of pcDNA3.1 zeo (+). The upper strand is SEQ 
ID NO:27. The lower strand (SEQ ID NO:28) is the reverse complement of SEQ ID 
NO:27. 

Figure 33 shows data from an embodiment of the present invention 
30 utilizing a p-galactosidase reporter gene in which the assay of the present invention 
was used to identify both a P-secretase inhibitor and a y-secretase inhibitor. See 
Example 8 for details. 

Figure 34 shows data from an embodiment of the present invention in 
which a fusion protein having a wild-type p-secretase cleavage site and a fusion 
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protein having a Swedish p-secietase cleavage site are compared. See Example 9 for 
details. 

Figure 35 A-B shows the DNA sequence (SEQ ID NO:29) of the fusion 
protein APP(1-651)NFEV, TATexonI(MlL) APP (664-695). 
5 Figure 36 shows the amino acid sequence of a fusion protein (APP(1- 

651)NFEV, TATexonI(MlL) APP (664-695)) (SEQ ID NO:30) containing the 
sequence NFEV at the P-secretase cleavage site (underlined at 2) and a wild-type K at 
position 612 (underlined at 3). The other portions of the fusion protein are indicated 
as follows: 1 = amino acids 1-651 of APP; 2 = region of P-secretase cleavage; 3 = 
10 wild-type K; 4 = region of y-secretase cleavage; 5 = linker; 6 = TAT exon I; 7 = 
linker; 8 = amino adds 664-695 of APP. 

Figure 37A-C shows the DNA sequence (SEQ ID NO:31) of the fusion 
protem APP(1-651)NFEV, GAI>t-VP16(delMet) APP (664-695). 

Figure 38 shows the amino acid sequence of a fusion protein (APP(1- 
15 651)NFEV, GAL4-VP16(delMet) APP (664-695)) (SEQ ID NO:32) containing the 
sequence NFEV at the p-secretase cleavage site (underlined at 2) and a wild-type K at 
position 612 (underlined at 3). The other portions of the fusion protein are indicated 
as follows: 1 = amino acids 1-651 of APP; 2 = region of P-secretase cleavage; 3 = 
wild-type K; 4 = region of y-secretase cleavage; 5 = linker; 6 = GAL4-VP16; 7 = 
20 linker; 8 = amino acids 664-695 of APP. 



DETAILED DESCRIPTION OF THE E^VENTION 
For the purposes of this invention: 

A 'fusion protein" is a protein that contains at least two polypeptide 
25 regions and, optionally, a linking peptide to operatively link the two polypeptides into 
one continuous polypeptide. Hie at least two polypeptide regions in a fusion protein 
are derived fix)m different sources, and therefore a fusion protein comprises two 
polypeptide regions not normally joined toother in nature. 

A "'linking sequence (or linker peptide)" contains one or more amino 
30 acid residues joined in peptide bonds. A linking sequence serves to join two 

polypeptide regions of differing origins in a fusion protein via a peptide bond between 
the linking sequence and each of the polypeptide regions. 

Typically, a fusion protein is synthesized as a continuous polypeptide 
in a recombinant host cell which contains an expression vector comprising a 
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nucleotide sequence encoding the fusion protein where the different regions of the 
fusion protein are fused in ftame on either side of a linker peptide's coding sequence. 
The chimeric coding sequence (encoding the fusion protein) is operatively linked to 
expression control sequences (generally provided by the expression vector) that are 
5 functional in the recombinant host cell. 

'^Reporter gene," as used in the present invention, does not mean a 
DNA sequence present on the chromosome of a cell, generally possessing introns, as 
is often meant by the word "gene" in the art. Rather "reporter gene" means any DNA 
sequence encoding a protein or polypeptide that can give rise to a signal that can be 

10 detected or measured '"Reporter gene" does not mean a portion of the amino acid 
sequence of APP. "TReporter gene" will usually mean a DNA sequence, generally a 
cDNA sequence (although in some cases a reporter gene may have introns) that 
encodes a protein or polypeptide that is conmionly used in the art to provide a 
measurable phenotype that can be distinguished over backgroimd signals. 

15 A "nuclear localization signal (NLS)" is a region of a polypeptide 

which targets the polypeptide to the nucleus of the cell. One such NLS is that from 
tiie SV40 large T antigen. See, e.g., U.S. Patent No. 5,589,392; Kalderon et al., 1984, 
Cell 39:499-509. The minimum region of tiie S V40 large T antigen with NLS activity 
is Pro-Lys-Lys-Lys-Arg-Lys-Val (SEQ ID NO:22). See also U.S. Patent No. 

20 5,776,689. 

"Substances" that are screened in the present invention can be any 
substances that are generally screened in the pharmaceutical industry during the drug 
development process. For example, substances may be low molecular weight organic 
compounds (e.g,, having a molecular weight of less than about 2,000 daltons and 

25 preferably less than about 1.000 daltons), RNA, DNA, antibodies, peptides, or 

proteins. Substances are often tested m the methods of the present invention as large 
collections of substances, e.g. libraries of low molecular weight organic compounds, 
peptides, or natural products. 

The conditions under which substances are employed in the methods 

30 described herein are conditions that are typically used in the art for the study of 

protein-ligand interactions or enzyme inhibition studies: e.g,y salt conditions such as 
those represented by such commonly used buffers as PBS or in tissue culture media; a 
temperature of about 4°C to about 55X; incubation times of from several seconds to 
several hours or even up to 24 or 48 hours. Screening for the identification of 
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enzyme-specific inhibitors is a well-known procedure in the pharmaceutical arts and 
the numerous conditions under which such screening has been done are available in 
the literature to guide the practitioner of he present invention. 

A "conservative amino acid substitution" refers to the replacement of 
5 one amino acid residue by another, chemically sinailar, amino acid residue. Examples 
of such conservative substitutions are: substitution of one hydrophobic residue 
(isoleucine, leucine, valine, or methionine) for another; substitution of one polar 
residue for another polar residue of the same charge (e.g., arginine for lysine; glutamic 
acid for aspartic acid); substitution of one aromatic amino acid (tryptophan, tyrosine, 

10 or phenylalanine) for anoth^. 

'Transfection'* refers to any of the methods known in the art for 
introducing DNA into a cell, eg., calcium phosphate or calcium chloride mediated 
transfection, electroporation, infection with a retroviral vector. 

The present invention relates to the discovery of an assay system that 

IS permits the simultaneous screening for inhibitors of several types of amyloid 
precursor protein (APP) processing or signaling (e.g., p-secretase cleavage, y- 
secretase cleavage, APP extracellular signaling, APP cytoplasmic signaling). In a 
preferred embodiment, this screening is accomplished without the concomitant 
identification of inhibitors of a-secretase. The assay system is carried out in a single 

20 type of cell, using a single type of assay readout. Inhibitors discovered by means of 
the present invention are expected to be useful in the treatment of Alzheimer's disease 
since these inhibitors are likely to be capable of interfering with the production of Ap. 

Previous assays for identifying inhibitors of APP processing have 
focussed specifically on inhibition of either p-secretase or y-secretase activity, or on 

25 inhibition of some other single aspect of AP production. In contrast, the assays 

described herein are directed to inhibition of APP processing in graeral. Substances 
identified throug^h these assays may target P-secretase, y-secretase, modulators of P- 
secretase or y-secretase activity, or even an as-yet-undiscovered ligand interaction 
with APP. In certain embodiments, these assays will also be firee of the potentially 

30 misleading or obscuring effects of ot-secretase activity. In addition, unlike other 
assays curcenfly in use, these assays are homogeneous assays; i.e., they require no 
cumbersome or time-consunung steps such as column chromatography separations, 
immunoprecipitations, washing steps, etc. Therefore, the assays are very well adapted 
to a high throughput screening format. 
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Ja the present invention, novel recombinant DNA molecules are 
constructed in which nucleotide sequences encoding at least a portion of the luminal 
(i.e., N-terminal to the transmembrane region) and transmembrane regions of APP are 
fused to nucleotide sequences encoding a transcription factor. In a prefeired 
5 embodiment, the APP contains an a-secretase cleavage site that has been altered to 
reduce or eliminate a-secretase cleavage. This allows the assays of the present 
invention to avoid identifying inhibitors of a-secretase and permits the more efficient 
detection of P-secretase inhibitors since a-secretase and P-secretase compete for APP 
cleavage. The recombinant DNA molecules may be transfected, along with a reporter 

10 gene, into a cell line that processes APP into Ap, and stable clones may be generated 
Alternatively, the recombinant DNA molecules and reporter plasmid may be utilized 
in transient transfections. 

Upon expression in cells, the APP/transciiption factor fusion protein 
localizes to a non-nuclear membrane of the cell (e.g., the endoplasmic reticulum) due 

IS to the presence of the APP sequences in the fusion protein. In a manner similar to 
cleavage of APP, the fusion protein will then be cleaved, first by P-secretase and then 
by Y-secretase. y-secretase cleavage releases the transcription factor from the 
membrane in which the APP/transcription factor fusion protein had been embedded, 
after which the transcription factor translocates to the nucleus and stimulates 

20 transcription of the reporter gene. Assuming no ocrsecretase cleavage, cleavage by 
both P-secretase and y-secretase is required for release of the transcription factor and 
transactivation of the reporter gene in this assay since y-secretase cleavage of APP is 
dependent on a short luminal domain, such as that generated by P- or a-secretase 
cleavage. Detection of a signal fix>m the reporter gene product will thus serve as 

25 evidence of APP processing. In particular, since activation of the reporter gene 

requires both P-secretase and y-secretase cleavage, the assay is capable of identifying 
inhibitors of both or either of these proteases. 

Hgure 27 is a schematic diagram depicting general features of the 
assay. The vertical bar in Figure 27A represents the fusion protein; the horizontal bar 

30 represents the non-nuclear membrane in which the fusion protein is embedded before 
processing. Figure 27B shows how the transcription factor portion of the fusion 
protein (with small amounts of the APP portion flanking it) has moved to the nucleus 
following release from the fusion protein by APP processing. In the nucleus, the 
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transcription factor is shown binding to a regulatory DNA sequence CTranscription 
Factor Response Element") and activating transcription of the reporter gene. 

The recombinant DNA molecules encoding the APP/transcription 
factor fusion protein and the reporter gene can be used to develop novel homogenous 
5 cell-based assays for the identification and assessment of inhibitors of AFP processing 
which will be readily amenable to high throughput technology. 

In one embodiment, the recombinant DNA molecules used in this 
invention comprise sequences encoding the amino terminal 651 amino acids of the 
695 amino add version of APP (Kang et al., 1987, Nature 325:733-736), including all 

10 the sequences necessary for the production of AP, as well as the C-terminal 32 amino 
adds of APP. The transcription factor is placed between the N-terminal and C- 
terminal portions of APP. The APP sequence may include a modification to increase 
the amount of ^-secretase cleavage of the fusion protein. This modification involves 
mutating the K at position 612 of the a-secretase cleavage site to a V (K612V). Since 

15 a-secretase and p-secretase compete for APP cleavage, reducing or elimmating APP 
cleavage by a-secretase results in increased 3-secretase cleavage, and allows the assay 
to detect P-secretase inhibitors more readily. In addition, the P-secretase cleavage site 
within APP (KMIDA) (SEQ ID NO:34) may be modified, e.g., to that of a naturally 
occurring mutation (termed the "Swedish" mutation or NLiDA) (SEQ ID NO:38) 

20 which has been shown to enhance P-secretase cleavage six-fold in cultured cells. 

Another possible modification is to replace the (KM^IdA) (SEQ ID NO:34) wild-type 
p-secretase cleavage site with the sequence (NPiEV) (SEQ ID NO:40). The presence 
of NFBV in an amino add sequence has been shown to enhance p-secretase cleavage 
by an even larger amount than the Swedish sequence. See U.S. Provisional Patent 

25 Application Serial No. 60/292,591 and U.S. Provisional Patmt Application Serial No. 
60/316,115, the disclosures of which are incorporated herein, in their entirety. 

In a preferred embodiment, HIV-1 TAT exon I has been fused between 
sequences encoding the first 651 amino acids of APP695 and the last 32 amino acids 
of APP695 (APP-TAT-APPct32). Co-transfection of an expression vector comprising 

30 this construct with a reporter gene plasmid containing an HV- 1 LTR promoter that 
controls the transcription of a reporter gene leads to enhanced expression of the 
reporter gene. Other transcription factors that could be fused to APPi-651 include 

Gal4-VP16, the entire Gal4 protein, BIV TAT, HIV-2 TAT, SIV TAT, LexA-VP16, 
EB V Zta, Papillomavirus E2, or tissue or species specific homodimeric bHLH 
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transciiption factors capable of activating transcription throu^ specific DNA 
response elements, such as E12, E47, or Twist. The use of GAL4, BIV, HIV-2, or 
SIV TAT may be useful if it is desired to reduce the potency of the transactivator, thus 
reducing any background transactivation caused by non-specific cleavage of the fusion 
5 protein. To further reduce the potential for transactivation by TAT in tiie absence of 
P-secretase and y-secretase cleavage, the TAT portion of the fusion protein may be 
altered to remove the N-terminal methionine and thus eliminate the possibility of 
aberrant translation of TAT through any potential internal ribosomal entry sites. 

In some circumstances, high level expression of TAT has been found 

10 to be toxic to cells. Thus, when TAT is the transcription factor fused to APP in die 
methods of the preset invention, it may be advantageous to utilize transient 
transfection with low amounts of the expression vector encoding the APP/IAT fusion 
protein. A set of preliminary experiments in which various amounts of the vector are 
transfected, in order to titrate acceptable levels of TAT, is recommended. 

15 The reports gene used will depend in large part upon the transoription 

factor fused to APP. The promoter used to drive the reporter gene will be LTR for 
TAT-based APP fusion proteins, or UAS (6x) for GAL4-VP16-based APP fusion 
proteins. In a particular embodiment, an LTR driving EGFP (enhanced green 
fluorescent protein, a brighter variant of GFP made by Aurora Biosciences, San 

20 Diego, CA) has been used to observe processing of an APP/TAT fusion protein. 
Under certain conditions, it may be desirable to use a less stable reporter, such as 
dsEGFP (a destabilized variant of EGFP made by Aurora Biosciences, San Diego, CA 
and marketed by Clontech, Palo Alto, CA) or a more potent reporter, such as p- 
lactamase. Alternatively, a stable HeLa cell line expressing LTR-p-galactosidase can 

25 be used. If the exquisite sensitivity of ^-lactamase nudces it less than optimal for a 
particular purpose, the LTR-^-galactosidase cell line may be exploited for this assay. 
Finally, under some circumstances Gal4-VP16 may prove to be optimal relative to 
TAT to reduce any inherent background problems associated with using the weakly 
but constitutively active LTR in the reporter plasnud, in which case the reporter 

30 plasmid could be UAS(6x)-P-lactamase (Aurora Biosciences, San Diego, CA). 

A variety of cells are suitable for use in the methods of the present 
invention. Particularly preferred are eukaryotic, especially mammalian, cell lines. In 
particular embodiments, the cells are selected from the group consisting of: L cells L- 
M(TK-) (ATCC CCL 1.3), L cells L-M (ATCC CCL 1.2), HEK293 (ATCC CRL 
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1573), HEK293T, Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 
1650), COS-7 (ATCC CRL 1651), CHO^Kl (ATCC CCL 61), 3T3 (ATCC CCL 92), 
NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS- 
CA (ATCC CCL 26), T24 (ATCC HTB-4), PC12 cells, Jurkat cells, H4 ceUs (ATCC 
5 HTB-148), and MRC-5 (ATCC CCL 171). 

To make the assay more amenable for ultra-high throughput screening, 
a non-adherent cell line, such as Jurkat, can be used. 

Generally, the assays of the present invention employ cells that 
naturally express p-secretase and y-secretase. However, it is possible to practice the 

10 invention in cells that lack the expression of one, or both, of these enzymes. In such 
cases, P-secretase and y-secretase activity can be provided by the lecombmant 
expression of these enzymes in the cells. 

In one embodiment, the present invention provides a recombinant cell, 
prefi^ably a eukaryotic cell, even more preferably a mammalian cell, and most 

15 preferably a human cell, where the cell expresses a fusion protein of APP and a 
transcription factor and the cell contains a reporter gene that can be activated by the 
transcription factor. The fusion protein comprises a portion of APP wh^ that portion 
includes the regions of the P-secretase and y-secretase cleavage sites fused to a 
transcription factor. The region of APP including the p-secretase and y-secretase 

20 cleavage sites can be, e.g., a portion of APP that includes amino acids 589-651 of the 
695 amino add version of APP. This region is shown below. 

i i i 

EEISEVKM DAEFRHDSGYEVHHQKLVFFAEDVGSNKGA^GL^IVGGW lA 
25 TVIVirLVMLKKK (SEQ ID NO:33) 

i 

The P-secretase cleavage site is shown at position 596-597 (KM DA) (SEQ ID 
NO:34). 

Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
30 638-639 

i i 

(GW lA TV)(SEQIDNO>35). 

The fusion protein will be anchored in the membrane by the APP 
35 sequences shown above. The N-terminal portion of APP must include at least the P- 
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secretase cleavage site, and possibly several amino-acids N-terminal to the P-secretase 
cleavage site to make the assay sensitive to both p-secietase and y-secretase inhibitors. 
In many cases, the APP sequences will include sequences fijrther N-tenninal than 
those shown above, including the signal sequence at the N-tenninus of APP. In cases, 

5 where the APP signal sequence is not used, another signal sequence may be 

incorporated in the fusion protein. Such other signal sequences are known in the art. 

In a related embodiment, the present invention provides a recombinant 
cell, preferably a eukaryotic cell, even more preferably a mammalian cell, and most 
preferably a human cell, where the above-described APP portion of the fusion protein 

10 contains a K612V mutation. The APP portion of this embodiment is shown below. 

i i i 

EEESEVKM DAEFRHDSGYEVHHQYLVEFAEDVGSNKGAnGI^ lA 
TVIVnLVMLKKK (SEQ ID NO:36) 

15 i 

The P-secretase cleavage site is shown at position 596-597 (KM DA) (SEQ ID 
NO:34). 

Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
638-639 

20 i i 

(GW lA TV)(SEQIDNO:35), 
The underlined V at position 612 shows the change in sequence in the present 
invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by a-secretase. 

25 In a related embodiment, the present invention provides a recombinant 

cell, preferably a eukaryotic cell, even more preferably a mammalian cell, and most 
preferably a human cell, where the above-described APP portion of tiie fusion protein 
contains the Swedish version of the P-secretase cleavage site as well as a K612V 
mutation. The APP portion of this embodimrat is shown below. 

30 

i i i 

EEISEVNL DAEE5lEIDSGYEVHHQYLVEFAEDVGSNKGAnGI^ lA 
TVaVntYMLEOKK (SEQ ID NO:37) 

i 

35 The P-secretase cleavage site is shown at position 596-597 (ML DA) (SEQ ID 
NO:38). 
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Two predominant cleavage sites of y-seotetase are shown at positions 636-637 and 
638^39 

i i 

(GW lA TV)(SEQIDNO:35). 
5 The underlined V at position 612 shows the change in sequence in the present 

invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by Orsecretase. 

In a related embodiment, the present invention provides a recombinant 
cell, piefembly a eukaryotic cell, even more preferably a mammalian cell, and most 
10 preferably a human cell, where the above-described APP portion of the fusion protein 
contains the NE^V version of the P-secretase cleavage site as well as a K612V 
mutation. The APP portion of this embodiment is shown below. 

I i i 

15 EEISEVNF EVEFRHDSGYEVHHQVLVEFAEDVGSNKGAnGU^C^ lA 
TVIVnLVMLKKK (SEQ ID NO:39) 

I 

The p-secietase cleavage site is shown at position 596-597 (NF EV) (SEQ ID 
NO:40). 

20 Two predominant cleavage sites of y-secretase are shown at positions 636-637 and 
638-639 

i i 

(GW lA TV)(SEQIDNO:35). 
The underlined V at position 612 shows the change in sequence in the present 

25 invention from the wild-type K to the mutant V, which change provides for reduced 
cleavage by a-secretase. 

The presence of both P-secretase and y-secretase cleavage sites in the 
fusion proteins permits the assays of the present invention to detect inhibitors of both 
p-secretase and y-secretase. 

30 The recombinant host cells of the presrat invention can be further 

engineered to comprise a reporter gene construct. Hie reporter gene construct 
contains a reporter gene in operable linkage with a regulatory DNA sequence that 
confers on the reporter gene the property of being regulated by the transcription factor 
of the fusion protein. This regulation is such that expression of the reporter gene is 

35 low or absent without binding of the transcription factor to the regulatory DNA 
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sequence but, when the transcription factor is released from the fusion protein by APP 
processing, the transcription factor can move into the nucleus of the cell and bind to 
the regulatory DNA sequence, thereby activating transcription from the reporter gene. 
Reporter genes desirably give rise to gene products which can be 
5 detected or quantitated, either in terms of amount of protein synthesized, enzymatic 
activity, fluorescence, luminescence, or some other phenotype. Suitable reporter gene 
products include firefly luciferase (de Wet et al., 1987, Mol. Cell. Biol. 7:725-737) or 
bacterial luciferase (Englebrecht et al., 1985, Science 227:1345-1347; Baldwin et al., 
1984, Biochem. 23:3663-3667), ^-lactamase, P-glucuronidase, P-galactosidase, green 

10 fluorescent proteins, enhanced green fluorescent protein, destabilized enhanced green 
fluorescent protein, red fluorescent protein, yellow fluorescent protein, cyan 
fluorescent protein, destabilized yellow fluorescent protein, destabilized cyan 
fluorescent protein, aequorin, chloramphenicol acetyl transferase (Alton & Vapnek, 
1979, Nature 282:864-869), rat Uver alkaHne phosphatase (Toh et al„ 1989, Bur. L 

15 Biochem. 182:231-237), human placental secreted alkaline phosphatase (Cullen & 
Mallim, 1992, Meth. Enzymol. 216:362-368), and horseradish peroxidase, among 
others. 

A preferred reporter gene is green fluorescent protein (GFP) or a 
modified GFP. Wild-type GFP has long been used in the art. Starting from green 

20 fluorescent protein, many modified versions have been derived with altered or 

enhanced spectral properties as compared with wild-type GPP. See, e.g., U.S. Patent 
No. 5,625,048; International Patent Publication WO 97/28261; Jhtemational Patent 
PubUcation WO 96/23810. Useful are the modified GFPs WIB and TOPAZ, 
available commercially from Aurora Biosciences Corp., San Diego, CA. WIB 

25 contams the following changes fix)m the wild-type GEP sequence: F64L, S65T, 

Y66W, N146I, M153T, and V163A (see Table 1. page 519, of Tsien, 1998, Ann. Rev. 
Biochem. 67:509-544). TOPAZ contains the following changes from the wild-type 
GFP sequence: S65G, V68L, S72A, and T203Y (see Table 1, page 519, of Tsien, 
1998, Aim. Rev. Biochem. 67:509-544). Wild-type nucleotide and amino acid 

30 sequences of GFP are shown in Figure 1 and SEQ ID NO: 1 of International Patent 
Publication WO 97/28261; in Figure 1 of Tsien, 1998, Ann. Rev. Biochem, 67:509- 
544; and in Prasher et al., 1992, Gene 1 1 1:229-233. 

When expressing GK^s in mammalian cells, it may be advantageous to 
constract versions of the GFPs having altered codons that conform to those codons 



-20- 



wo 03/072041 



PCTAJS03/05458 



preferred by mammalian cells (Zolotukhin et al., J. Virol. 1996, 70:464646754; Yang 
et al., 1996, Nucl. Acids Res. 24:4592-4593). Another way of improving GFP 
expression in mammalian cells is to provide an optimal ribosome binding site by the 
use of an additional codon inomediately after the starting methionine (Cramen et al., 
5 1996, Nature Biotechnology 14:315-319). 

Transcription factors that are useful in the present invention are 
preferably those transcription factors that are not naturally expressed in the 
recombinant host cells. This is so the regulatory DNA sequence is not activated 
absent APP processing and release of the transcription factor firom the fusion protein. 

10 Preferably, the transcription factor contains, or is engineered to contain, a nuclear 
localization signal. This is so that, after release, the transcription factor will move 
into the nucleus of the genetically modified host cells where it can bind to, and 
activate, the regulatory DNA sequence, leading to expression of the reporter gene. 
Transcription factors, as used in the present invention, do not include proteins that, 

15 after release from a fusion protein and translocation into the nucleus, repress 
transcription fix)m a reporter gene. 

Among the transcription factors that are useful in the present invention 
are: HIVl TAT (in particular exon I of HIVl TAT), Gal4-VP16, the entire Gal4 
protein, BIV TAT, HIV-2 TAT, SIV TAT, LexA-VP16, EBV Zta, Papillomavirus E2, 

20 or one of the bHLH homodimeric transcription factors, E12, E47, or Twist. 

Expression vectors are generally used to express the fusion protein in 
the recombinant cells. An expression vector contains recombinant nucleic acid 
encoding a polypeptide (e.g., an APP/transcriptiion factor fusion protein) along with 
regulatory elements for proper transcription and processing. Generally, the regulatory 

25 elements that are present in an expression vector include a transcriptianal promoter, a 
ribosome binding site, a transcriptional terminator, and a polyadenylation signal. 
Other elements noiay include an origin of replication for autonomous replication in a 
host cell, a selectable marker, a limited number of useful restriction enzyme sites, and 
a potential for high copy number. 

30 A variety of expression vectors are known in the art and can be used in 

flie present invention. Commercially available expression vectors which are suitable 
include, but are not limited to, pMClneo (Stratagene), pSG5 (Stratagene), pcDNAI 
and pcDNAIamp, pcDNA3, pcDNA3.1, pCR3.1 (hivitrogen, San Diego, CA), EBO- 
pSV2-neo (ATCC 37593), pBPV-l(8.2) (ATCC 37110), pdBPV-MMTneo(342-12) 
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(ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pCLneo 
(Promega), pTRE (Clontech, Palo Alto, CA), pVlJneo, pIRESneo (Qontech, Palo 
Alto, CA), pCEP4 (Invitrogen, San Diego, CA), pSCll, and pSV2~(lhfi: (ATCC 
37146). The choice of vector will depend upon the cell type in which it is desired to 

5 express the APP/transcription factor fusion protein, as well as on the level of 
expression desired, and the like. 

The expression vectors can be used to transiently express or stably 
express the fusion protein. The transient expression or stable expression of 
transfected DNA is well known in the art. See, e.g., Ausubel et al., 1995, 

10 'Introduction of DNA into mammalian cells," in Current Protocols in Molecular 
Biology, sections 9.5.1-9.5.6 (John Wiley & Sons, Inc.). 

The lecombinant host cells of the present invention are useful in 
methods of screening substances for the ability to inhibit APP processing. In one 
embodiment, the methods of the present invention comprise adding a candidate 

15 substance to a recombinant host cell comprising an APP/transcription factor fusion 
protein and a reporter gene and comparing the level of expression of the reporter gene 
protein in the presence and absence of the candidate substance, wherein the level of 
expression of the reporter gene protein is lower when the candidate substance inhibits 
processing of the APP/transcription factor fusion protein such that the transcription 

20 factor is not released, or is released in a lower amount, than in the absence of the 
substance. 

The level of expression of the reporter gene protein is generally not 
measured directly. Rather, an indirect method is used. For example, fluorescence 
given off by the reporter gene protein may be detected or measured as, e.g., when the 

25 reporter gene product is a grera fluorescent protdn; or, some enzymatic activity of the 
reporter gene product may be detected or measured, e.g., when the reports gene 
product is ^-lactamase. 

The candidate substance may be of any form suitable for entry into the 
cytoplasm of the recombinant cell or for contact with the cellos cytoplasmic 

30 membrane. Under appropriate conditions, the candidate substance may be allowed to 
fteely diffuse mto the cell, or the delivery of the substance may be facilitated by 
techniques and substances which enhance cell permeability, a wide variety of which 
are known in the art. Methods for increasing cell permeability include, without 
limitation, the use of organic solvents such as dimethylsulfoxide, liposomes. 
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application of electrical current, and physical means such as substance-coated teflon 
pellets. 

The present invention provides a method of identifying a substance that 
inhibits APP processing comprising: 
5 (a) providing a recombinant eukaryotic cell which: 

(i) expresses a fusion protein comprising amino acids 589- 
651 of APP695 and a transcription factor where flie transcription factor is fused in 
frame to the caiboxyl terminus of amino acids 589-651 of APP695; and 

(ii) comprises a reporter gene operably linked to a regulatory 
10 DNA sequence which is capable of being activated by the transcription factor, 

(b) measuring the level of reporter gene product in the cell in the 
absence of the substance; 

(c) adding the substance to the cell and measuring the level of 
reporter gene product in the cell in the presence of the substance; 

15 where a decrease in the level of reporter gene product in the presence 

as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

The manner in which the level of the reporter gene product is measured 
will be determined by the nature of the reporter gene and, often, the characteristics of 

20 the host cell. For example, if the reporter gene product itself is fluorescent, as for 
example, when a green fluorescent protein is the reporter gene product, fluorescence 
from the cell can be measured directly. When the reporter gene product has enzymatic 
activity, for example, when the reporter gene product is ^-lactamase, known methods 
of measuring that enzymatic activity can be used. 

25 For the sake of clarity, the above method is described in teems of "a" 

cell. In actual practice, the method will generally be carried on a large number of cells 
at one time. For example, the method will often be carried out in a well of a tissue 
culture plate, where, depending on the number of wells in the plate (and thus thebr 
size), there can be up to hundreds, thousands, or even several million cells. The step 

30 of "adding the substance to the cell" is generally carried out by simply adding the 
substance to the tissue culture medium in which the cells are present. After the 
substance is added to the cell, the cell and ttie substance are incubated for a period of 
time sufficient for the substance to inhibit APP processing, if the substance is actually 
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an inhibitor of APP processing. This praiod is usually from about 15 minutes to 48 
hours, but may be somewhat more in unusual cases. 

A convenient way of carrying out the method is to grow a population 
of the recombinant eukaryotic cells and then split the population into a portion that 
5 will be exposed to the substance and a portion that will not be exposed to the 
substance. 

The recombinant eukaryotic cell is generally produced by transf ection 
of an expression vector encoding the fusion protein and by transfection of a plasmid 
containing the reporter gene. 

10 One skilled in the art would recognize that what is sought in terms of 

"a decrease in the level of reporter gene product in the presence as compared to the 
absence of the substance" is a non-trivial decrease. For example, if in the method 
described above there is found a 1% decrease, this would not indicate that the 
substance is an inhibitor of APP processing. Rather, one skilled in the art would 

15 attribute such a small decrease to normal experimental variance. What is looked for is 
a significant decrease. For the purposes of this invention, a significant decrease 
fulfills the usual requirements for a statistically valid measurement of a biological 
signal. For example, depending upon the details of the embodiment of the invention, 
a significant decrease might be a decrease of at least 10%, preferably at least 20%, 

20 more preferably at least 50%, and most preferably at least 90%. 

In particular embodiments, amino acids 589-651 of APP695 contain a 

K612V mutation. 

In particular embodiments, the cell is a mammalian cell. In particular 
embodiments, the cell is a human cell. 
25 In particular embodiments, the method is used to screen a library of 

more than 1,000 substances. In other embodimrats, the method is used to screen a 
library of more than 50,000 substances at a rate of more than 1 ,000 substances per 24 
hours. 

In particular embodiments, the fusion protein comprises a portion of 
30 APP that is selected firom the group consisting of: amino acids 1-65 1 of APP695» 
amino acids 50-651 of APP695, amino acids 100-651 of APP695, amino acids 150- 
651 of APP695, amino acids 200-651 of APP695, amino acids 250-651 of APP695, 
amino acids 300-651 of APP695, amino acids 350-651 of APP695, amino acids 400- 
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651 of APP695, amino acids 450-651 of APP695, amino acids 500-651 of APP695. 
and amino acids 550-651 of APP695. 

In related embodiments, the fusion protein does not comprise all of 
amino acids 589-651 of APP695. Rather, the fusion protein comprises slightly fewer 

5 amino acids from APP. For example, the fusion protein might comprise slightly fewer 
amino acids of the P-secretase cleavage site: e.g., amino acids 590-651 of APP695- 

Or the fusion protein might comprise slightly fewer amino acids of the y-secretase 
cleavage site: amino acids 589-650 of APP695; amino acids 589-649 of APP695; 
amino adds 589-648 of APP695; or amino adds 589-647. The fusion protein mscy 

10 even comprise slightly fewer amino adds from both ends, e.g., amino adds 590-647 
of APP695. What is important is that the portion of APP included in the fusion 
protein contains both the P-secretase cleavage site and the y-secietase cleavage site. 

In particular embodiments, the transcription factor is selected from the 
group consisting of: HIV-1 TAT, Gal4-VP16, the entire Gal4 protein, LexA-VP16, 

15 EBV Zta, Papilloxnavirus E2, one of the bHLH homodimeric transcription factors, 
including E12, E47, or Twist, or BIV TAT, inV-2 TAT, or SIV TAT. A particular 
version of HIV-1 TAT suitable for use in the present invention is HIV-1 TAT exon L 

Fusion proteins suitable for use in the present invention can be selected 
from the group consisting of: APP(1-651)SW, K612V-TATexonI(MlL) APP (664- 

20 695) (SEQ ID NO:2); APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) (SEQ 
ID N0:4); APP(1-651)SW, K612V, Gal4-VP16(M1L) APP (664-695) (SEQ ID 
N0:6); APP(l-651)wt, K612V, Gal4-VP16(M1L) APP (664-695) (SEQ ID N0:8); 
APP(1-651)SW, TATexonI(MlL) APP (664-695) (SEQ ID NO:10); APP(l-651)wt, 
TATexonI(MlL) APP (664-695) (SEQ ID NO: 12); APP(1-651)SW, Gal4- 

25 VP16(M1L) APP (664-695) (SEQ ID N0:14); APP(l-651)wt, Gal4-VP16(M1L) APP 
(664-695) (SEQ ID NO: 16); APP(1-651)NEEV, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID NO:23); and APP(1-651)NFEV, K612V, GAL4-VP16(M1L) APP 
(664-695) (SEQ ID NO:25). 

In some embodiments of the present invention, the amino acid 

30 sequences contributed to the fusion protein by the transcription factor constitute the 
caiboxy terminal amino acid sequences of the fusion protein. In other embodiments, 
the transcription factor has other sequences fused to its carboxy terminus, as in the 
examples herein where amino acids 664-695 of APP695 are fused to the carboxy 
terminus of the transcription factor and therefore constitute the carboxy terminal 
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amino acid sequences of the fusion protein. Other portions of APP (e.g., amino acids 
652-695 of APP695) could be used instead of amino acids 664-695 of APP695- In 
fact, it should be possible to extend the caiboxy tenninus of the transcription factor 
with almost any amino acid sequences, providing such sequences do not interfere with 
5 the ability of the transcription factor to move into the nucleus and activate 

transcription of the reporter gene once the transcription factor has been released from 
the fusion protein by the action of y-secretase. 



sequence from APP that is capable of bemg cleaved by both ^-secietase and y- 
secietase and a transcription factor where the transcription factor is fused in frame to 
the amino acid sequence from APP; and 



regulatory DNA sequence which capable of being activated by the transcription factor, 

(b) measuring the level of reports gene product in the cell in the 
absence of the substance; 

(c) adding the compound to the cell and measuring the level of 
20 reporter gene product in the cell in the presence of the substance; 

where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

In particular embodiments, the amino acid sequence from APP 

25 comprises: 



10 



The present invention includes a method of identifying a substance that 
inhibits APP processing comprising: 

(a) providing a recombinant eukaryotic cell which: 

(i) expresses a fusion protein comprising an amino acid 



15 



(ii) comprises a reporter gene operably linked to a 



30 



589- 651 of APP695i 

590- 651 of APP695; 

589- 650 of APP695; 

590- 650 of APP695; 

589- 649 of APP695; 

590- 649 of APP695; 

589- 648 of APP695; 

590- 648 of APP695: 
589-647 of APP695; 



or 
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590-647 of APP695. 

In related embodiments, the amino acid sequence from APP contains 
the amino acid sequence NLDA (SEQ ID NO:38) at the P-secretase cleavage site 
instead of the wild-type sequence KMDA (SEQ ID NO:34). 
5 In related embodiments, the amino acid sequence from APP contains 

the amino acid sequence NFEV (SEQ ID NO:40) at the p-secretase cleavage site 
instead of the wild-type sequence KMDA (SEQ ID NO:34). 

The portion of the fusion protein that is derived from APP may contain 
mutations that are known in the art. Of particular interest are mutations that result in 
10 an increased proportion of AP being made in the form of Api-42 rather than Api-40. 
Such mutations are disclosed in the following publications (numbering is from the 
770 amino acid version of APP): 

Swedish CK670N, M671L): MuUan et al., 1992, Nature Genet. 1:345-347. 

Hemish (A692G): Hendiiks et al., 1992, Nature Genet 1:218-221; Cras et al., 1998, 
15 Acta Neuropafliol. (Berlin) 96:253-260. 

Dutch (E693Q): Levy et al., 1990, Science 248:1124-1126. 

Arctic (E693G): Nilsberth et al., 2001, Nature Neuroscience 4: 887-893. 

Austrian (T714I): Kumar-Singh et al., 2000, Hum. Mol. Genet. 9:2589-2598. 

French (V715M): Ancolio et al., 1999, Proc. Natl. Acad. Sci. (USA) 96:4119-4124. 
20 Florida (ni6V): Eckman et al., 1997, Hum. Mol. Genet. 6:2087-2089. 

V717F: MuneU et al., 1991, Science 254:97-99. 

V717G: Chartier-Harlin et al., 1991, Nature 353:844-846. 

London (V717I): Goate et al., 1991, Nature 349:704-706. 

L723P: Kwok et al., 2000, Ann. Neurol. 47:249-253. 
25 r716F (also called I45F, referring to flie position relative to the p-secretase cleavage 

site): This mutation in APP changes processing of AP almost exclusively to Api-42. 

Lichtenthaleret al., 1999, Proc. Nati. Acad. Sci. (USA) 96:3053-3058. 

As with many proteins, it may be possible to modify many of the 

amino acids of the fusion proteins described above and still retain substantially the 
30 same biological activity in terms of APP processing as for the original fusion protein. 

Thus, the present invention includes modified fusion proteins which have amino acid 

deletions, additions, or substitutions but that still retain substantiaUy the same 

properties with respect to APP processing as the fusion proteins described herein. It is 

generally accepted that single amino acid substitutions do not usually alter the 
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biological activity of a protein (see, e.g., Molecular Biology of the Gene , Watson et 
al., 1987, Fourth Ed., The Benjamin/Cummings Publishing Co., Inc., page 226; and 
Cunningham & Wells, 1989, Science 244:1081-1085). Accordingly, the present 
invention includes fusion proteins where one amino acid substitution has been made 
5 in the fusion proteins described herein where the fusion proteins still retain 

substantially the same properties with respect to APP processing as the fusion proteins 
described herein. The present invention also includes fusion proteins where two or 
more amino acid substitutions have been made in the fusion proteins described herein 
where the fusion proteins still retain substantially the same properties with respect to 
10 APP processing as the fusion proteins described herein. In particular, the present 
invention includes embodimrats where the substitutions are conservative 
substitutions. 

With the exception of Figure 18, the nucleotide and amino acid 
sequences of APP disclosed herein contain a minor difference compared to APP 

IS sequences that are usually reported in the literature. For the sequences disclosed 

herein with such a difference, the nucleotide at position 367 is an A rath«: than a G, as 
in most published APP sequences. This change results in a conservative substitution 
in the corresponding APP amino acid sequence. Thus, the amino acid sequences 
disclosed herein with such a difference have an I rather than a V at position 123. This 

20 difference does not affect the properties of the fusion proteins for the purposes of the 
present invention. Therefore, fusion proteins having the APP sequence reported in the 
literature with an G at nucleotide position 367 and a V at amino acid position 123 and 
the fusion proteins disclosed herein with an A at nucleotide position 367 and an I at 
amino acid position 123 are to be considered equivalents for the purposes of the 

25 present invention. 

The Gal-VP16 sequences disclosed herein contain two changes from 
the usual published sequences. There is T to C change at nucleotide position 2131 
that causes a S to P change at amino acid position 712; there is A to C change at 
nucleotide position 2301 that does hot change die amino add sequence. It is expected 

30 tiiat Gal-VP16 proteins containing the usual sequences reported in the literature will 
also be suitable for use in the present invention. 

The methods of the present invention can be used to screen libraries of 
substances or other sources of substances to identify substances that are inhibitors of 
P-secretase or y-secretase. Such identified inhibitory substances can serve as "leads" 
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for the development of phannaceuticals that can be used to treat patients having 
Alzheimer*s disease or in a prophylactic manner to prevent or delay the development 
of Alzheimer's disease. Such leads can be further developed into pharmaceuticals by, 
for example, subjecting the leads to sequential modifications, molecular modeling, 
5 and other routine procedures employed in the pharmaceutical industry. The inhibitors 
of APP processing identified by the present invention may also be tested in animal 
models of Alzheimer's disease such as the various transgenic mouse models that are 
known in the art. 

Although a wide variety of substances can be screened by the methods 

10 of the present invention, preferred substances for screening are libraries of small 
molecule compounds. Small molecule compounds are preferred because they are 
more readily absorbed after oral administration, have fewer potential antigenic 
determinants, and are more likely to cross the blood/brain barrier than larger 
molecules such as nucleic acids or proteins. 

IS Once identified by the methods of the present invention, the candidate 

small molecule compounds may then be produced in quantities sufficient for 
pharmaceutical testing and formulated in a pharmaceutically acceptable carrier (see, 
e.g., Reminjgton's Pharmaceutical Sciences , Gennaro, A., ed.. Mack PubUshing, 1990, 
for suitable methods). The candidate compounds may be administered to cell lines 

20 relevant to Alzheimer' s disease, animal models of Alzheimer's disease, or 
Alzheimer's disease patients. 

The numbering of the amino acids in APP used herein is based on the 
695 amino acid version of APP described in Kang et al., 1987, Nature 325:733-736. 
There are two other major versions of APP, having 751 amino adds and 770 amino 

25 acids (see, Ponte et al., 1988, Nature 331:525-527 for the 751 anoino acid version and 
Kitaguchi et al., 1988, Nature 331:530-532 for the 770 amino acid version). One 
skilled in the art will understand how to translate the nmribering used herein, based on 
the 695 amino acid version of APP, into the corxesponding numbering for other 
versions of APP. For example, some of the APP/transcription factor fusion proteins 

30 of the present invention contain the K612V mutation, based on the numbering of the 
695 amino acid version. This mutation would correspond to a K668V mutation in the 
751 amino acid version and a K687V mutation in the 770 amino acid version. 
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Therefore, when a "K612V" mutation is referred to herein, it will be 
understood that such reference also includes a K668V mutation of the 751 amino acid 
version of APP as well as a K687V mutation of the 770 amino acid version of AFP. 

Similarly, the portion of APP referred to as APPl-651 herein, based on 
5 the 695 amino add version, will be understood to mean also APPl-707 of the 751 
amino acid version and APPi-726 of the 770 amino acid version. 

If desired, inhibitors that are identified by the methods of the present 
invention can be further tested to determine which step in APP processing they affect. 
Assays that are known to be specific for the various steps of APP processing can be 

10 used for this purpose. For example, the assay of Karlstr5m et al., (Journal of 

Biological Chemistry papers in press, published on December 13, 2001 as Manuscript 
C100649200) is only capable of detecting inhibitors of y-secretase and cannot also 
detect inhibitors of other steps of APP processing such as, e.g., inhibitors of 
secretase. If an inhibitor identified by the methods of the present invention is found to 

15 also be an inhibitor when tested in the assay of Karlstrom et al., then that inhibitor is 
at least a y-secretase inhibitor. It is still possible that that inhibitor could inhibit other 
steps in APP processing as well. Further tests known in the art can determine this. 

The present invention may be modified so as to provide methods of 
determining at which step of APP processing a known inhibitor of APP processing 

20 exerts its effect. The known inhibitor may be one that has been identified by the 
methods of the present invention or by some other method. The modification to the 
present invention consists in mutating the P-secretase site in a fusion protein so that P- 
secretase cleavage can no longer occur at the site or occurs at a very much reduced 
level. Providing that the fusion protein contains a cleavable o&-secretase site, the 

25 fusion protein can still be used in the methods of the preset invention. However, this 
fusion protein (with a mutated P-secretase site) can no longer detect ^secretase 
inhibitors. Hierefore, if the known APP processing inhibitor still functions as an APP 
processing inhibitor in this modified version of the invention, then the known 
inhibitor cannot be a P-secietase site inhibitor but instead must exert its effect 

30 downstream of ^-secretase. 

Suitable mutations of the P-secretase site include the following. All 
the sequences are for amino acid positions 594-598 of APP695- 
VNFAV (SEQ ID N0:41): This mutation shows decreased P-secretase cleavage 
relative to the wild type, KMDA (SEQ ID NO:34), sequence. 
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VKVDA (SEQ JD NO:42): Vassar et al., 1999, Science 286:735-741. This mutant 
was tested in vitro only, but purified P-secretase failed to cleave a 30-amino acid 
peptide containing this sequence. 

WKMDA (SEQ ID NO:43), VKADA (SEQ ID NO:44), VKKDA (SEQ ID NO:45), 
5 VKEDA (SEQ ID NO:46), VKIDA (SEQ ID NO:47), VKMIA (SEQ ID NO:48), 
VKMNA (SEQ ID NO:49), VKMEA (SEQ ID NO:50), VKMDE (SEQ ID NO:51), 
VKMDK (SEQ ID NO:52): Citron et al., 1995. Neuron 14:661-670. These mutations 
decreased AP production 4-20X relative to p3 production in cultured cells. 

Fusion proteins can be constructed by use of the polymerase chain 

10 reaction (PGR) to amplify desked portions of APP and transcription factors, which 
can be then be cloned into expression vectors by methods well known in the art 
Primes for PGR will generally include a small part of the APP or transcription factor 
as well as convenient cloning sites and/or linker peptide sequences. The PGR primers 
can be used to amplify the desired APP or transcription factor fragments from sources 

15 such as previously cloned APP or transcription factors, cDNA libraries, or genomic 
libraries. The amplified APP and transcription factor sequences can be cloned into 
suitable expression vectors. Methods of PGR and cloning are well known in the art 
and can be found in standard reference materials such as those listed below. 

Standard techniques for cloning, DNA isolation, amplification and 

20 purification, for enzymatic reactions involving DNA ligase, DNA polymerase, 

restriction endonucleases and the like, and various separation techniques are known 
and commonly employed by those skilled in the art. A number of standard techniques 
are described in Sambrook et al. (1989) Molecular Gloning, Second Edition, Gold 
Spring Harbor Laboratory, Plainview, N.Y.; Maniatis et al. (1982) Molecular Gloning, 

25 Gold Spring Harbor Laboratory, Plainview, N. Y.; Wu (ed) (1993) Meth, EnzymoL 
218, Part I; Wu (ed) (1979) Meth. Enzymol. 68; Wu et al. (eds.) (1983) Meth. 
Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. Enzymol. 65; Miller 
(ed.) (1972) Experiments in Molecular Genetics, Gold Spring Harbor Laboratory, 
Gold Spring Harbor, N.Y.; Old and Primrose (1981) Principles of Gene Manipulation, 

30 University of California Press, Berkeley; Schleif and Wensink (1982) Practical 

Methods in Molecular Biology; Glover (ed.) (1985) DNA Gloning Vol. I and H, IRL 
Press, Oxford, UK; Hames and Higgins (eds.) (1985) Nucleic Acid Hybridization, IRL 
Press, Oxford, UK; Setlow and HoUaender (1979) Genetic Engineering: Principles 
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and Methods, Vols. 1-4, Plenum Press, New Yoric, and Ausubel et al. (1992) Currmt 
Protocols in Molecular Biology, GieeneAViley, New York, N.Y.. 

PGR reactions can be carried out with a variety of thermostable 
enzymes including but not limited to AmpliTaq, AmpliTaq Gold, or Vent polymerase. 

5 For AmpliTaq, reactions can be carried out in 10 mM Tris-Cl, pH 8.3, 2.0 mM 
Mga2, 200 fiM of each dNTP, 50 mM KQ, 0.2 fiM of each primer, 10 ng of DNA 
template, 0.05 units//il of AmpUTaq. The reactions are heated at 95°C for 3 minutes 
and then cycled 35 times using suitable cycling parameters, including, but not Umited 
to, 95°C, 20 seconds, 62°C, 20 seconds, 72X, 3 minutes. In addition to these 

10 conditions, a variety of suitable PGR protocols can be found in PGR Primer, A 
Laboratory Manual, edited by CW. Dieffenbach and G.S. Dveksler, 1995, Gold 
Spring Harbor Laboratory Press; or PGR Protocols: A Guide to Methods and 
A pplications . Michael et al., eds., 1990, Academic Press. 

It is desirable to sequence the DNA ^coding the fusion proteins, or at 

15 least the junction regions of the various portions (APP, transcription factor, linkers) of 
the fusion protein in order to verify that the desired portions have in fact been 
obtained, joined properly, and that no unexpected changes have been introduced into 
the sequences by the PGR reactions. 

Suitable PGR primers for amplification of DNA sequences for use in 

20 the present invention can be readily designed by those of skill in the art. Examples of 
such primers are shown below. 

5'-GGA GAG GAT ATC ATG GAG GCA GTA GAT CG-3' (SEQ ID NO:53) can be 
used to amplify the 5' portion of HIV-1 TAT exon L 

25 

5'-TAC ATG GGG GCC GGC TAG TTA CTG CTT TG-3* (SEQ ID NO:54) can be 
used to an^lify the 3' portion of HIV-1 TAT exon L 

5 ' -GGA TGT GAT ATC TTT CTT CTT GAG CAT GAG C AA GG.3 ' (SEQ ID 
30 NO:55) can be used to amplify the 3' portion of DNA encoding amino acids 1-651 of 
APP, i.e., the transmembrane region of APP. 

The following non-Umiting examples are presented to better illustrate 

the invention. 
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EXAMPIEl 

Transfection of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexoiiI with 
pMM321 

5 The following example demonstrated that an APP/TAT fusion 

construct will transactivate a reporter gene in which the HIVl LTR regulatory DNA 
sequence controls the expression of enhanced green fluorescent protein (EGFP). The 
following also serves as an example of the kind of preliminary routine variations of 
fusion protein levels and inhibitor levels that may be advantageous to test in the 

10 practice of the present invention. Such routine variations are often helpful in 
validating the assays before a large scale screoiing project is undertaken. 

The APP/TAT fusion construct is referred to as "pcDNAS.l zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonr' (see Figure 26) and contains the HEVl 
TAT exon 1 fused just aft^ the transmembrane domain of AFP. This construct is also 

15 shown in outline form in Figure IB. "pMM321" refers to a reporter gene plasmid 
consisting of the HIVl LTR driving the transcription of enhanced green fluorescent 
protein (see Figure 25). As a positive control for TAT expression, a construct in 
which TAT was under the control of a strong, constitutive promoter ( referred to as 
'*pUCd5TAr'; see Figure 24) was used. 

20 

METHODS: 

1. Day 1: Pass HEK 293T cells into 2x6 well dishes at 1 x 105 cells/well. 

2. Day 2: Transfect cells with 9 |jL Fugrae and 0.125 \ig pMM321 and various 
25 amounts of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonL 

Plate 1: 1. 5 |ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

2. 2.5 Mg pcDNAS.l zeo (+) APP(1-651)SW, K612V- 
(MlDTATexonl 

30 3. 1.25 |igpcDNA3.1 zeo (+)APP(1-651)SW, K612V- 

(MlDTATexonl 
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4. 0.625 ng pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

5. 0.312 Jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

5 6. 0.156 ng pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

(MlL)TATexonI 

Plate 2: 1, 0.08 Mg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

2. 0.04 fig pcDNA3.1 zeo (+) APP(1-651)SW. K612V- 
10 (MlL)TATexonI 

3. no pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

4. 0.625 pUCdSTAT 

5. 0.625 ngpcDNA3.1zeo(+)APP(l-651)SW,K612V- 
(MlL)TATexoaI and 1 |ig pMM321 

15 6. l|igpMM321 

Six hours post-transfection, green cells were only observed in plate 2, #5. 

3. Day 3: The fluorescence intaisity of the transfectedceUs was observed and 
20 recorded. 

4. Day 4: The fluorescence intensity of transfected cells was observed and recorded. 
RESULTS: 

25 Co-transfection with pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 
increased GFP expression in the cells. 

Day 3: 

• 5 |ig- no green cells 

30 • 2.5 fig - no green cells (too much DNA for these two transfections?) 

• 1 .25 fig - many bright and dim green cells (see photographs and figure in ancillary 
data) 

• 0.625 fig - bright and dim green cells but fewer than at 1 .25 fig 

• 0.312 fig - no difference obvious between 0.625 fig and 0.312 
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• 0.156 Jig - very few green cells 

• 0.08 Jig - very few green cells 

• 0.04 \ig - very few green cells 

• no pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI - extremely few 
5 (if any) green cells 

• 0.625 Jig pUCd5TAT - cells were extremely bright, not necessarily more in 
number than with pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI 

• 0.625 ^lg pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 1 fig 
pMM321 - many bright and dim green cells. 

10 • 1 (ig pMM321 many-fold fewer green cells, some bright, most dim. 

Day 3 - changed media (saved 1 mL conditioned media from wells Plate 1- 3, 4, 5, 6; 
Plate 2 - 1, 2, 3, 5, 6). Added ftesh media with 10 \M of Lr685,458 (a potent, cell 
permeable y- secretase inhibitor) to wells Plate 1 - 3, 4, 5, 6; Plate 2 - 3, 4, 5, 6. 
15 Waited 48 hours to observe loss of fluorescence since GFP is so stable. 

After 48 hours, all wells appeared brighter than at 24 hour time point. This does not 
necessarily mean that the inhibitor was ineffective, or that the assay did not work, 
since there were no controls run where the inhibitor was not added. However, it does 
20 suggest that under these conditions it may be preferable to add the inhibitor at the tune 
of transf ection to shut down y-secretase as soon as possible and avoid release of TAT 
and induction of GFP. 

BXAMPLB2 

25 Transfection of APP(1-651)SW, K612V-(MlL)TATexonI into HEK293T and H4 
cells accompanied bv inhibition of v-secretase acti vity with L-685,458 

The following example demonstrates the operation of the invention in 
HEK293T cells and H4 cells and shows inhibition of APP processing (and thus TAT 
release) by treatment with a known y-secretase inhibitor. "pcDNA3.1 zeo (+) APP(1- 
30 651)SW, K612V-(MlL)TATexonI," "pMM321," and "pUCd5TAT" are the same as 
in Example 1. H4 cells (ATCC HTB-148) are a neuronal cell line. 
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MEraODS: 

Dayl: Plated out 2 x 6 well plates of HEK293T cells and 2 x 6 well plates of H4 cells 
at lxl05 cells/well. 

5 Day 2: Transfected cells with 2 ng total DNA - pcDNA3.1 zeo (+) APP(1-651)SW, 
K612V-(MlL)TATexonI and carrier (a pET-lN plasmid). 
Platel: 

U: 1 ng pMM321 + 1 \ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI 

10 3,4: 1 ]ig pMM321 + 0.1 ng pcDNAS.l zeo (+) APP(1-651)SW, K612V- 
^lL)TATexonI+ 0.9 (Jg carrier 

S: 1 pMM321 -f 1 MS carrier (added too much carrier to this well in H4 cells) 
6: 1 ng pMM321 +0.1 \ig pUCdSTAT + 0.9 |ig carrier 

15 Plate 2: 

1.2: 0.1 MS pMM321 + 1 M« pcDNAB.l zeo (+) APP(1-651)SW, K612V- 

(MlL)TATexonI + 0.9 \ig carrier 

3.4: 0.1 |Xg pMM321 + 0.1 MS pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI + 
20 1 .8 Mg carrier (added too 0.5X carrier to this mix in 293T cells) 
5: 0.1 MgpMM321 + 1.9 Mg carrier 
6. 0.1 M^g pMM321 + 0.1 Mg pUCdSTAT + 1.8 Mg carrier 

Transfections for HEK293T cells: 9 ML Rigene/well. Combined with DNA in 
25 Optimem and incubated and added to cells according to manufacturer's instructions. 

Transfections for H4 cells: 6 \iL FUgoie/well. Combined with DNA in Optimem and 
incubated and added to cells according to manufacturer's instructions. 

30 Added 10 mM L-685,458 to Plates 1 and 2, wells 2 and 4 for both cell types within 1 
hour of transfection. Observed cells p^odically. 

Took pictures at 24, 46 hours after transfection, using AE lock to keep exposures 
constant between wells. 
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RESULTS: 

Both H4 and 293T cells turned much brighter green in the presence of pcDNAS.l zeo 
(+) APP(l-651)SW,K612V-(MlL)TATexonI 
5 At 24 hours: 
H4 cells: 

Plate 1: lugpMM321 

1. 1 ug pMM321 + 1 ug pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI: Many 

10 bright and dim green cells (good transfection efficiency as well) 

2. 1 ug pMM321 + 1 ug pcDNAS.l zeo (+) APP(1-651)SW, K612V- 
0^1L)TATexonI + 10 pM L-685,458: Also many brigjit and dim green cells, but 
reduced compared with 

well#l 

15 3. Very few green cells (a few per field) 

4. Very few green cells 

5. A few dimly green cells 

6. Some induction with 0.1 \ig pUCdSTAT but still relatively few cells. 

20 Plate 2: 0.1 jig pMM321 

1. 0.1 \ig pMM321 + 1 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI: 

--10 bright green cells/field and the rest are dim green 

2. 0.1 \ig pMM321 + 1 fig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

25 (MlL)TATexonI+ 10fiMLr685,458: -3-5 bright green cells/field, some dim green, 
and some not green. 

3. No green cells 

4. No green cells 

5. No green cells 

30 6. A few bright green cells 

HEK293T cells: 
Plate 1: lfXgpMM321 
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1. + 1 Jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexoiiL of 15 ceUs: 
6 dim, 4 

medium, S bright 

2. + 1 Mg pcDNA3.1 zeo (+) APP(1-^51)SW. K612V-(MlL)TATexonI + 10 pM 
5 685,458: 

of 15 cells: 6 very dim, 5 dim, 1 medium, 3 bright 

3. +0.1 ng pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI: many more 
green 

cells than 1 fig, lots of strong, bright green cells 
10 4. + 0.1 ng pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 |jM 
685,458: 

Fbwer bright green cells/field but intensity does not appear strongly diminished 

5. 1 Mg pMM321 alone: Most cells in the field expressing dim to medium levels of 
GFP 

15 6. enhancement by 0.1 MS pUCdSTAT 

Plate 2: 0.1 \ig pMM321 

1. +lngpcDNA3.1zeo(+)APP(l-651)SW,K612V-(MlL)TATexonL Bright and 
medium 

20 green cells 

2. + 1 fAg pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 10 jiML- 
685,458: 

Bright, medium, and dim green cells 

3. +0.1ngpcDNA3.1zeo(+)APP(l-651)SW.K612V-(MlL)TATexonI: Bright. 
25 medium, 

and dim green cells 

4. + 0.1 \tg pcDNA3.1 zeo (+) APP(1-651)SW. K612V-(MlL)TATexonI + 10 pM L- 
685,458: 

Bright, medium, and dim green cells 
30 5. 0. 1 ng pMM321 alone: Most expressing ceUs have dim GFP, a few medium to 
bright 
cells 

6. Enhancement by 0.1 jxg pUCdSTAT 
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Changed media on cells at 24 houis past transfection. Kept 10 ]M L-68S»4S8 on cells 
in wells 2 and 4. 

At 46 hours after transfection, examined the wells again. Lots of floating cells in all 
5 wells, all cell types. Highest number of floaters in 1 jig pcDNAS.l zeo (+) APP(1- 
651)SW, K612V-(MlL)TATexonI lanes. 

Took some photographs under fluorescent and white light (white light at low 
intensity) to reveal fluorescent and non-fluorescent cells. Conducted a subjective 
10 analysis of the photographs to see if the amount of inhibition by 10 |jM Lr685,458 
was in any way quantifiable. Counted bright (white in the middle), strong (blue 
middle), medium (green) and dim/non-fiuorescent cells and determined the 
approximate fraction of each level of expression. Results follow: 

15 TABLE 1 



293T cells transfected with X |ig pMM321 (first number in left colunm) and X \ig 
pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI (second number in left 
column) 



Transfection 


# Blight 


# Strong 


#Med 


#Non 


% Bright 


%Strong 


%Med 


%Non 


lUg + lMfi 


none 


8 


81 


316 




2 


20 


78 


1 + 1 + cmpd 


none 


5 


66 


521 




0.8 


11 


88 


1 + 0.1 


41 


77 


143 


61 


12 


24 


44 


19 


1+ 0.1+cmpd 


23 


32 


149 


194 











20 Hie results shown in Table 1 mdicate fliat the presence of L-685.458 ("cmpd") caused 
fewer strong and medium fluorescing cells as well as more non-fluorescent cells in the 
first run; in tiie second run, L-685,458 caused fewer bright and stcong fluorescing cells 
as well as more non-fluorescent cells (with slightly more medium fluorescing cells). 
Overall, these data clearly indicate that the presence of an inhibitor of APP processing 

25 such as L-685,458 can be identified by the present invention. 

1 mL of conditioned media from each well was analyzed for production of Ap. 
Higher than background levels of AP were observed in 293T cells after transfection 
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with 1 Jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI and 0.1 \ig 
pcDNA3.1 zeo (+) APP(1-651)SW, K612V.(MlL)TATexonI and higher than 
background levels of AP in H4 cells after transfection with 1 \ig pcDNAS.l zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonI, but not 0.1 jig pcDNA3.1 zeo (+) APP(1- 
5 651)SW, K612V-(MlL)TATexonI (no enhancement of GFP was observed with 0.1 
Jig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI either). AP was 
completely inhibited to background levels by 10 jiM L-685,458. Surprisingly, 
substantial inhibition of GFP was not observed with 10 jiM L-685,458. 

10 100,000 cells from each well were trypsinized and placed in 0. 1 mL phenol red-free 
media in a Costar 96-well dish and read using the fluoromet^. The results are shown 
below: 

TABLE 2 



A 






IttgAPPTAT 


10iiM458 


ClagAPPtat 


10UM458 


no tat 


0.1ug 

pUCd5TAT 










B 


293T 


log 

LTRGFP 


17670 


14321 


65535 


65535 


14890 


65535 










C 




O.lug 
LTRGFP 


9976 


10491 


17677 


14790 


9624 


25735 










D 


H4 


lug 

LTRGFP 


21307 


25307 


7175 


7136 


7147 


7277 










E 




0.1ng 
LTRGFP 


9574 


10031 


7317 


6957 


6946 


7247 










F 




Blank 


7498 


7124 


7570 


7454 


5774 


7638 










G 


























H 




























1 


2 


3 


4 


5 


6 


7 


8 


9 


10 11 


12 



15 In Table 2, "APPtat" is pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI. 
"458" is L-685,458. "LTRGFP" is pMM321. 



-40- 



wo 03/072041 



PCT/US03/05458 



0.1 lig pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexoiiI with and without 
compound exceeded the maximum leading of the fluorometer, as did the addition of 
0.1 fig pUCdSTAT to cells transfected with 1 jig pMM321. 
1 ^g pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI + 1 \xg pMM321 
5 incrementally increased the amount of fluorescence relative to 1 jxg pMM321 alone, 
and this was reduced to background levels by 10 \jM L-685,458. Inhibition of 
fluorescence was also observed in 293T cells transfected with 0.1 |Lig pcDNAS.l zeo 
(+) APP(1-651)SW, K612V-(MlL)TATexonI + 0.1 jig pMM321. No inhibition of 
fluorescence was observed in H4 cells under any transfection conditions. 

10 

EXAMPIJS3 

TlRft of APPf l-65nSW, K612V-TATexonI in H4 cells 

1^875,532 is a known y-secietase inhibitor having the structure shown below. It is 
described and details of its synthesis are disclosed in Seiffert et al., 2000, J. Biol. 
15 Chem. 275:34086-34091, 




1^875532 

Compound X is a P-secretase inhibitor. 

20 

pRBR186 (Figure 22A) is an expression vector containing DNA sequences encoding 
full-length APP containing the Swedish mutation and the K612V mutation. pRBR186 
does not contain a transcription factor fused to the APP sequences. 
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pcDNA3.1 zeo (+) APP(1-651)SW. K612V-(MlL)TATexonI is an expression vector 
that directs tiie expression of a the fusion protein APP(1-651)SW, K612V- 
(MlL)TATexonI in mammalian cells. This fusion protein contains the first 651 
amino acids of APP (with a Swedish version of the P-secretase cleavage site as well 
5 as the K612V mutation) fused in frame to exon I of HIVl TAT, which has been 
modified with a Metl-Leu mutation. A schematic diagram of pcDNA3.1 zeo (+) 
APP(1-651)SW, K612V-(MlL)TATexonI is shown in Figure 26A. The nucleotide 
sequence of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-(MlL)TATexonI (SEQ ID 
NO:22) is shown in Figure 26B-G. 

10 

METHODS: 

1. H4 cells (ATCC HTB-148) were transfected with the various constructs listed 
below using 6 )iL Fugene per 100 |jL Optimem and 100 |iL Optimem per well (6-well 
dishes). Transfection reactions were incubated for 30 minutes prior to adding 100 ]»L 
15 dropwise onto wells. 

Transfections w^ done as follows: 

1. 1 Jig pMM321 (Figure 25A-D) and 1 \Lg pcDNA3.1 backbone 

20 la. 1 Jig pMM321 and 1 H€ pcDNAS.l (kvitrograi, San Diego, CA) backbone. Prior 
to 

transfection, 10 \jM L-875,532 (y-secretase inhibitor) was added to the well. 

2. 1 Jig pMM321 and 1 Jig pRBR186 (Figure 22A; APP expression vector, 
processing 

25 and inhibition of processing control) 

2a. 1 ng pMM321 and 1 jig pRBR186. Prior to transfection, 10 |iM 1^875,532 was 

added to cells 
(transfection solution for 3-5 were prepared in bulk) 

3. and 3a. 1 Hg pMM321 and 1 Mg pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 

30 (MlL)TATexonI 

4. and 4a. Ijig pMM321 and 1 jig pcDNAS.l zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonI. Prior to transfection, 10 jiM 1^875,532 was added to the two 

wells. 

5. and 5a. 1 ^g pMM321 and 1 ]ig pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
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(MlL)TATesxonL Prior to transfection, 10 \>M Compound X was added to the two 
wells 

6. 1 ]ig pMM321 and 1 ng pUCdSTAT (Figure 24). 

7. 1 M-g pMM321 and 1 jig pUCdS TAT. Prior to transfection, 10 jjM 1^875,532 was 
S added to the cells. 

RESULTS: 

Cells were assessed by eye under a fluorescence miax)scope the morning following 
transfection (~20 hrs). 
10 1 and la, 2 and 2a. Weak fluorescence 

3 and 3a. Much strongs fluorescence 

4 and 4a. Clear inhibition of fluorescence 

5 and Sa. Possible inhibition of fluorescence, but doesn't look that ^at 

6 and 7. Almost blindingly fluorescent 

15 

At approximately 48 hours, cells were trypsinized, spun down, and resuspended in 
100 \jL PBS. The cellular contents of each well of the transfection plates were placed 
into one well of a 96-well fluorescence plate. Fluorescence was analyzed using the 
FLUOstar (485 excitation/538 emission). The results are shown in Table 3. 

20 

TABLE 3 



Transient transfections in H4 cells Fluor Units 

pMM321 4484 

pMM321 + 1^875,532 3443 

pMM321 + pRBR186 2735 

pMM321 + pRBR186 + 1^75,532 2161 

pMM321 + APP-TAT-ct32 20177 

pMM321 + APP-TAT-ct32 + 1^875,532 8283 

pMM321 + APP-TAT-ct32 + Compound X 1 1946 

pMM321 + pucd5-TAT 61102 



In Table 3, "APP-TAT-ct32" refers to pcDNA3.1 zeo (+) APP(1-651)SW, K612V- 
(MlL)TATexonL 
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For a graphical presentation of these results, see Figure 19. In Figure 19, XTR-GFP" 
refers to pMM321; "APP-TAT-ct32" refers to pcDNA3.1 zeo (+) APP(1-651)SW, 
K612V-(MlL)TATexonL Compare the bar labeled XTR-GFP + APP-TAT-ct32" 
5 with the bars labeled XTR-GFP + APP-TAT-ct32 + 1^875,532" and *TLTR-GEP + 
APP-TAT-ct32 + Compound X Inhibition by both the p-secretase inhibitor 
(Compound X) and the y-secretase inhibitor (1^875,532) is easily identified by the 
present invention. 

10 Conclusions: 

• The data indicate that the expression of the fusion protein APP(1-651)SW, K612V- 
(MlL)TATexonI enhances transactivation through the LTR of pMM321 in a manner 
that depends on APP processing. 

• APP(1-651)SW, K612V-(MlL)TATexonI expressmg cells were 6X brighter than 
IS pMM321 cells alone. 

• Treatment with L-875,532 decreased fluorescence 2.5X. 

• Treatment with Compound X decreased fluorescence L7X. 

• Expression of TAT via pucd5-TAT was almost blinding and was 19X above 
pMM321 alone, indicating that APP(1-651)SW, K612V-(MlL)TATexonI expression 

20 did not lead to levels of GFP as high as TAT alone. Despite the decreased activation 
shown by TAT when provided by the fusion protein, as compared with TAT driven by 
the AMLP (adenovirus major late promoter) in pucd5-TAT, the assay was easily able 
to identify both the p-secretase and the y-secretase inhibitors. 

• Control plasmids ^MM321 and pMM321 + pRBR186) wa» dimly fluorescent and 
25 were not inhibited by 1^875,532. 

A lower level of inhibition by the P-secretase inhibitor is to be expected since the 
K612V mutation decreases alpha-secretase activity by 95% and thus some alpha- 
secretase cleavage is to be expected. 

30 
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EXAMPIJB4 

Construction of pcDNAS.l (+) zeo APP(1-651)SW, K612V, GAL4-VP16(M1L) APP 
(064-6?? ) 

1. The GAL4-VP16 insert was prepared by PGR from pCR2.1 GAL4VP16 (Figure 
5 20) (Ihvitrogen, San Diego, CA). The PGR was performed to eliminate the N- 

terminal methionine by changing ttiis methionine into a leucine. 

40 ng pCR2.1 GAI>*-VP16 
0.2 ML GAWVP16 5' oUgo at 250 fiM: 
10 5'-CroAGMATCAAGCTACTGTCTrCTATCGAACAAGC-3' (SEQ ID NO:56) 
EcoRV site undedined 
0.2iJLGAI>*VP16 3'oUgo(at250MM): 5'- 

GCGCG^IVaCCCCAaXiTACTraTCAATI^ (SEQ ID NO:57) 
EcoRV site underlined 
15 5 pLlOX Buffer 
8ML25mMMgCl2 
4MLPCRdNTPs 
0.25 mL AmpliTaq Gold 
27.35 mL water 

20 

Cycle: 

Purified reactions using a C^aquick column 
Dieted entire reaction using EcoRV 
25 Ran the DNA on a 1% gel. Excised tiie band and purified using a QiaQuick gel 
purification kit 

2. Digested pcDNA3.1 APP(l-651)/APP(664-695) with EcoRV and SAP treated 
pcDNA3.1 APP(l-651)/APP(664-695) is an intermediate plasmid formed in the 

30 procedure described in Example 6. pcDNAS.l APP(l-651)/APP(664-695) the first 
651 amino acids of APP (with a Swedish vearsion of the P-secretase cleavage site as 
well as the K612V mutation) fused in frame to the last 32 amino acids of APP. 
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3. ligated pcDNAS.l APP(l-€5iyAPP(664-695) -BcoRV digested to GAL4VP16 
(EcoRV digested) 

5 4. At this point, it was realized that the 3' PGR primer for GAL4-VP16 put the 
APP(664-695) fragment out of frame. The APP(664-695) fragment was then re- 
PCR'd using the following protocol: 

1 pLpcDNA3.1 APP(l-651)-Gal4VP16-APP(664-695) 
10 50 nMAPPNotI5'ct32 in frame with OAI>t-VP16 
50nMAPPNoti3' ct32(5' 

(p)CTGCTGT(3SCQQ<2C2eCTAGTTC^^ (SEQ ID NO:58) 

NotI site underlined 

1 nL PC3R dNTPs (10 mM each dNTP, Roche) 
IS 5 |iL lOX Expand Buffer with MgCl2 

1 \>L ]&cpand polymerase (Roche) 

The PGR fragment was run on a 4% agarose gel and gel-purified using a QiaQuick gel 
20 purification colunm 

The fragment was digested with NotI and purified using a QiaQuick PGR purification 
column. 

5. APP(l-651)-Gal4VP16-APP(664-695)wasre-minipTepped. Minipiep #1 was 
25 dieted wilfa NoH, run on a 1% ^1, the upper band was then isolated and SAP- 
treated. 

6. APP(l-651)-Gal4VP16/NotI digested/SAP-treated was ligated to APP(664-695). 
30 7. Mniprqps containing inserts woe sequenced to verify the orientation of the insert 
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EXAMPLES 

Construction of pcDNA3.1 zeo (+) APP(l-651)wt, K612V-TATexonI(MlL) 
APP(664-695) 

This procedure replaced a fragment of APP in pcDNAS.l zeo (+) APP(1-651)SW, 
5 K612V-TATexonI(MlL) APP (664-695) that contained the Swedish mutation with a 
corresponding fragment from pRBR121 containing the wild-type ^-seoetase cleavage 
site rather than the Swedish p-secretase cleavage site. 

1. pcDNAS.l zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
10 (Rgure21B) was digested with SnaBI and then EcoRI 

17 }iL miniprep DNA 
2 ML lOX buffer 
lfiLSnaBI(NEB) 

15 Digest was purified using C^aquick PCR purification kit. Entire digest was then 
cleaved witii EcoRI for 2 hours. 

2. pRBR121 (Figure 21 A) was digested witii SnaBI and then EcoRI 

5ngpRBR121 
20 5 fiLlOX buffer 

2.5 mL SnaBI (NEB) 
q.s. 50 pL with water 

The digest was purified using a C^aquick PCR purification kit The entire digest was 
25 then cleaved with EcoRI for 2 hours. 

3. Botfi digests were run out on a 1% agarose gel. From the pRBR121 lane, the 2.4 
kb SnaBI-EcoRI fragmMit containing tiie wild-type P-secretase cleava^ site was 
isolated. 
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Prom PCDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonKMlL) APP (664-695) 
digest, BOTH the S kb SnaBI-EcoRI backbone firagment AND the 200 bp Ecoia- 
EcoRI fragment weie isolated (see Hguie 21B). 

5 4. Athiee-partligationusingequalmolarratiosof the thiee fragments was earned 
out: 

The assumption was made that, since the starting plasmids were of similar sizes and 
the same amount was digested for each plasmid, the recovered fragments would be in 
approximately equal molar ratios. 

10 

Vector alone: 

ImL APP-TAT-ct32 SnaBl/EcoRI 5Kb ftagment 
7 water 
2 ML 5X buffer 
15 10 )iL 2X buffer OEloche rqnd ligation Mt) 
1 |iLT41igase 

1+1+1 

1 mL pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
20 SnaBl/EcoRI 5KB firagment 

1 nL pRBR121 SnaBI/EcoRI 2.4 Kb insert 

1 ML pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRI/EcoRI 200 bp insert 
5 mL water 
25 2ML5Xbuffer 
10ML2X buffer 
lMLT41igase 

1+1+. . . 1 (in this 3-pt ligation, the ligation of two of the fir^ente together was done 
30 1", thMi the third fragment was added) 

1 ML pcDNAB.l zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
SnaBl/EcoRI 5Kb backbone 

1 mL pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRl/EcoRI 200 bp insert 
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5 water 

2 pL5X buffer 
10 mL2X buffer 
1 |JLLT41igase 

5 waited 5 minutes 

then added l|JiL pRBR121 SnaBIZEcoRI 2,4 kb insert 

1+1+3 

1 MLpcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
10 SnaBIZEcoRI 5 kb backbone 

1 (XL pRBR 121 SnaBIZEcoRI 2.4 H> insert 

3 }jL pcDNAS.l zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 
EcoRIZEcoRI 200 bp insert 

3 water 
15 2 nL5X buffer 
10 nL2X buffer 
lMLT41igase 

Transformed and plated out 200 \iL. The number of colonies in the vector + insert 
20 ligations far exceeded the number of colonies in the vector alone ligation. Picked 12 
colonies from 1+ 1+. . . L 
Picked 6 colonies from 1+3. 
Miniprepped 

Digested with EcoRI to ensure that small 200 bp fragment was incorporated. 
25 RESULTS 

Mmipreps #10 and 15 contained 200 bp EcoRI fragment. 

Oriented with Bam HI digestion. 

30 Sequenced with sAPPb F2 and F3 primers. Miniprep #15 contains both inserts in the 
correct orientation. 
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EXAMPIE6 

Construction of pcDNA3.1 zeo (+) APP(1-651)SW, K612V-TATexonI(MlL) 
APPf664-6951 

1. PGR of APP (664-695): 

5 The starting material was the pRBR186 plasmid (Figure 22A). 

PGR of APP (664-695) 
lngpRBR186 

50 nM 5* oUgo f5'-TGCC(^G CGCGGCX:GCO CX}ATGCTGCCCGQ-3') (SEQ ID 
10 NO:59)NotI site underlined 

50 nM 3' oligo (5*-(p)ATGGTGTC3GCGGCCGCAGACGCCGCTGTCACC-3') 

(SEQ E> NO:60) NotI site underlined 

1 |jL Roche PGR nucleotides 

5 |jL lOX Expand buffer 
15 40 water 

1 pL Expand 

Gycle: (94°C, 5 min) - 25X (94°C. 30 sec; 42°^ 1 min; 72''G, 2 min) - 72°G x 6 min 
-4»Ghold. 

20 

The ~100 bp fragment was gel purified (4% agarose, IX TBE gel) 

The gel-purified fingn^t was ligated into NotI digested, Shrini) Alkaline 
Phosphatase-tieated pcDNA3.1 zeo (+) (Invitto^). The presence of the insot and 
25 its orientation was confibrmed by sequencing. 

2. PCRofAPP(l-651): 
lngpRBR186 

50 nM 5' oUgo (5'-(p)AGGGGACAAGCITCGCCGGGCAGGGTGGCGATGCTG- 
30 3') (SEQ ID NO:61) ffindm site underlined, Met(l) ATG of APP in bold 

50nM3' oligo (5'-(p)GGATGTMGCTTTTTCTTGTTGAGGATCACCAAGG-3') 
(SEQ ID NO:62) Hindm site is underlined 
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1 |iL Roche PGR nucleotides 
5 \jL lOX Expand buffer 
40 \iL water 
1 ML Expand 

5 

Cycle: (94°C, 5 min) - 25X (94°C, 30 sec; 37°C, 1 min; 72°C 2.5 min) - 72^C x 6 
min-4°Chold 

• ITie amplified fragment was isolated on an agarose gel. The fragment was 

10 purified from the gel using Qiaquick Gel purification columns. Ihe fragment was 
digested with HindUL Hie amount of the firagment was too small to subclone, so 
the PGR was repeated using 1 |jiL of the amplified firagment and carrying out S 
reactions simultaneously. 

15 • The fragments were purified from these reactions using a QiaQuick PGR 

purification kit. Tho fragments were eluted in 30 and digested with HmdlH for 
2 hours. The digested fragments were tiien gel purified. 

• The purified fragments were ligated to pcDNA3.1 zeo (+) APP(664-695) fliat had 
20 been digested with Hindm and SAP treated. This gave tiie intermediate plasmid 

pcDNA3.1 zeo (+) APP(l-651)/APP(664-695). 

3. PGR of (MIL) TAT: 

The starting material was NL4-3 viral plasmid OPigure 22B). 

25 

PGRteaction: 

1 ng NL4-3 viral plasmid 

50 nM TAT 5' RV Met-Leu PGR primer (5'- 

TCC AGATATGG TGGAGGCAGTAGATGGTAGAG-3') (SEQ ID NO:63) 
30 EcoRV site underlined, Met-Leu mutation in bold 
50 nM TAT 3 ' RV Met-Leu PGR primer 

(5'-GGTGGATMSCTGTGGTTTGATAGAGAAGG-3' ) (SEQ ID NO:64) 

EcoRV site underlined 

IpLPGRdNTPs 
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5 ML PGR lOX buffer with MgC12 
40 (jL water 

1 |iL Expand polymerase 

5 Cycle: 

94°Cfor5min 

[30 sec 94°C, 1 min 42°C, 1 min 72°C] x 25 cycles 

5niinat72C 

holdat4°C 

10 

• The insert was purified over QiaQuick PGR pxirificationcoliinm 

• The entire reaction was dieted with 30 units EcoRV for 3 hours 

• The ~200 bp insot was gel purified. 

• pcDNA3.1 zeo (+) APP(l-651)/APP(664-695) was digested with EcdRV, and 
IS then SAP treated 

• The Metl-Leu TAT ftagment was Ugated to pcDNAS.l zeo (+) APP(1- 
651)/APP(664-695). 

A map of the resulting plasmid is shown in Figure 22C. 

20 EXAMPLE? 

Design of novel expression vector for expression of APP(1-651)SW, K612V- 
TATexoninVim APP(664-695) 

PURPOSE: To provide a low level expression of APP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695), a prokaryotic selectable marker ttiat is NOT 
25 ampidllin (read-through of the P-lactamase gene is sometimes a problom), and a 
eukaryotic selectable maricer tiuit is NOT zeodn (z»o is the marker for the reporto: 
plasmids in some embodiments). 

METHODS 

30 1. The dEYH> gene was removed from pd2EYFP (Clontech, Palo Alto, CA) using 
BamHI and Notl. The 5' overhangs was filled in using Klenow, and the plasmid was 
re-circularized. 
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pd2EYFP plasmid was digested with BamHI, Notl. 

Ran reaction on 1% agarose gel. Digestion was coirq)lete. Cut out 3.4 kb band. 
Purified using Qiagen Gel Extraction Kit. 
5 Klenow fill-in: 

-4 \ig plasmid backbone 
7.5 ML NEB buffer 2 

33 mM each dNTP (diluted from Roche PGR dNTPs) 
water to 75 jiL 
10 4 |iL Roche Klenow fragmrat (4 units) 

Incubated at room temperature for 15 minutes 
Heat inactivated at 70^C 

15 Took 1 pL fill-in reaction. 
Diluted to 8 pL with water 
Added2 ML 5XDNA buffer 
Added 10 mL 2X Ligation Buffer 
Added 1 pL T4 DNA Ugase 

20 

Incubated at room temperature 

Transformed 2 mL ligation into Invitrogen maximum efficiency DH5alpha competent 
cells. 

25 

Plated out on Kanamycin plates. Lots of colonies. 

2. The RS V promoter from pREP4 Otavitrogen) was excised using BgUI and Hindni 
and cloned into ttie re-drcularized plasmid. 

30 

Digested 5 \ig pREP4 with Hindlll 
Purified using Qiagen PGR purification kit 
Digested with Bgin. 
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The RSV promoter fragment was gel purified and cloned into the le-dicularized 
plasmid. 

3. Hie resulting expression vector (pRSV Kan/Neo res; Figure 23) has die eukaiyotic 
5 RSV promoter 5' to the pd2EYPP polylinker, SV40 driving neo and kanamycin 

prokaiyotic selection, and a pUC ori for high levels of replication in bacteria. 

EXAMPLES 

Use of APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) in HeLa cells with a 
10 B-galactosidase reporter gene 

The following demonstrates the practice of the present invention with 
the APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) fusion protein (SEQ ID 
NO:2) and p-galactosidase as a reporter gene. P4-R5 cells are HeLa cells that contain 
a stably integrated ^-galactosidase reporter gene under the control of the EIVl LTR. 

15 

Materials: 

1. ) Cells: P4-R5 cells 

2. ) DNA: 0.78 ^g/pL pcDNA3.1 zeo (+)APP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695) 

20 3.) Transfection reagents: FUGENE® 

4. ) Media: OPTIMEM® 

cDMEM (-)phenol red /10% FBS 

5. ) Compounds: Compound X(^secretase inhibitor) 10 mM 

L-87S432 (Y-secretase inhibitor) 10 mM 

25 

Protocol: 
Davl 

1.) Cell count on P4-R5 cells = 7.6 x 105 cells pea- ml in cDMEM (-)PR. 
Seeded sterile white luminometer TC plates with the following cell numbers: 
30 10 ml 

5 X 103/weU = 0.75 wl in 9.25 ml media 
1.0 X 104/well = 1.5 ml in 8.5 ml media 
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Seeded 100 fiL cells per well. 
Incubate overnight at 5% CO2. 

Day 2 

2. ) Made up media with appropriate dilutiotis of inhibitors. 

5 On no-inhibitor controls, added 100 \xL of cDMEM with 1% DMSO 

On wells with Compound X, added 10 mM inhibitor in cDMEM 
On wells with 1^875,532, added 10 mM inhibitor in cDMEM 

3. ) Prior to transfection, pulled off media on P4-R5 cells and replaced with media -/+ 
inhibitor. 

10 FUGENE® transfection: 
ForFUGENE® transfection: 

4. ) Added 600 of OPTIMEM® to sterile EPPENDORF® tube and cai^ 

added 30 (xL FUGENE® to media, without touching walls of tube. Incubated at room 
temp^tuie for 5 minutes. 
15 In separate EPPENDORF® tubes, added each DNA. 

Added FUGENE®/OPTIMEM® dropwise to DNA; incubated at room temperature 
for 15 minutes. 

Added 15 (xL /well of DNA/FUGENE®/OPTIMEM® dropwise to media in 
appropriate wells on P4-R5 cells, swirling to mix. 



TABLE 4 



Transfection 
number 


Cone of 
DNA 


Vol of DNA 


Vol of 

FUGENE® 


Vol of sterile 

OPTIMEM® 


APP-ML-Tat- 
APPct 


0.78 figfvL 


5jig= 6.5 ml 


30 ML of 
FUGENE® 


600(iLof 
OPTIMA® 


pUCdSTAT 




5Mg = 4-0ML 


30(jLof 
FUGENE® 


600MLof 
OPTIMEM® 


p243-4 


0.56 ngf\iL 


5Mg=9|jL 


30)jLof 
FUGENE® 


600|jLof 
OFITMEM® 



Li Table 4, "APP-ML-Tat-APPct" refers to pcDNA3.1 zeo (+)APP(1-651)SW, 
25 K612V-TATexonI(MlL) APP(664-695). "pUCdSTAT' is an expression vector that 
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serves as a positive control for TAT expression, since it is a construct in which TAT is 
under the control of a strong, constitutive pronaoter (see Figure 24). "p243-4"is a 
control expression vector that directs the expression of APP. 

5 5.) Plates were transferred to an incubator and incubated for 48 hours to allow 
expression and processing of the proteins. 
Day 4 

6.) The protocol below was followed for lysis of the cells and measurement of P- 
galactosidase in the cell lysates. 

10 

Measurement of ^-galactosidase in lysates of transfected cells. 

1. Removed TROPK® chemiluminescOTce kit(s) from cold 
room, allowed to come to room temperature in a 37®C water bath. 

2. ^galactosidase standards were prepared: 

15 Made 1 :5000 dilution of p-galactosidase stock (1 mg/ml) in lysis buffer. Did 2 fold 
dilutions. 

3. Diluted TROPK® substrate 1:25 into buffer. (Made enough 

for 100 pL /well). 

4. Added to reservoir and added 100 jjL/well. 

20 5. Added 10 \iL of P-galactosidase standards to column 12 on 

plate and incubated in dark for 1 hour. 

6. Read immediately in luminometer using standard file. Riled in 

required fields, read plate. 

25 The results are shown in Figure 33. Figure 33 demonstrates tiiat tfie present invention 
was able to identify both the P-secretase inhibitor (Compound X) and the y-secretase 
inhibitor (1^875,532). In Figure 33, "APP-tat-ct32" refers to pcDNA3.1 zeo 
(+)APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695). Although not indicated 
in Figure 33, the results for the controls were as expected: a large transactivation of 

30 the LTR by pUCdSTAT was observed which was not affected by either inhibitor. No 
transactivation was seen with p243-4. 
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EXAMFLE9 

Comparison of the use of APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) 
and APP(l-651)wt, K612V-TATexonI(MlL) APP(664-695) with a P-galactosidase 
reporter gene — — . 

5 The following shows a side-by-side comparison of the practice of the 

present invention with die APP(1-651)SW, K612V-TATexonI(MlL) APP(664-695) 
fusion protein (SEQ ID N0:2) and the APP(l-651)wt, K612V-TATexonI(MlL) 
APP(664-695) fusion protein (SEQ ID N0:4). P4-R5 cells are HeLa cells that contain 
a stably integrated ^-galactosidase reporter ^e undo: the control of the HIVl LTR. 

10 

Materials: 

1. ) Cells: P4-R5 cells 

2. ) DNA: 0.78 Hfi/pL pcDNA3.1 neo (+) APP(1-651)SW, K612V- 
TATexonI(MlL) APP(664-695) 

15 0.812 M-g/pL pcDNA3.1neo(+)APP(l-651)wt,K612V- 

TATexonI(MlL) APP(664-695) 

1.24 ng/nL pUCd5TAT 
0.56 jAg/nL p243^ 

3. ) Transfection reagents: FUGENE® 
20 4.)Ivfedia: OPTIMEM® 

cDMEM (-)phenol red /10% FBS 
5.) Compounds: Compound X(P-secretase inhibitor) 10 mM 
L-875,532 (Y-seoetase inhibitor) 10 mM 

Dayl 

25 1.) CeU count on P4-R5 ceUs = 5 x 105 cells per ml in cDMEM (-)PR. 

Seeded sterile white luminometer TC plates with the following cell numbers: 

10 ml 

5 X 103/well = 4.0 ml in 36.0 ml media 
Diluted 1:1 into media and seeded one plate at 2.5 x 103/well. 
30 Seeded 100 jiL cells per well. 

Incubated overnight at 37°C, 5% CO2. 

Day 2 
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2. ) Made up media with appropriate dilutions of inhibitors. 

On no-inhibitor controls, added 100 |iL of cDMEM with 1% DMSO 
On wells with Compound X, added titration curve from 100 \jM 
5 inhibitor in cDMEM (-)PR. 

On wells with L-875,532, added titration curve from 100 \jM inhibitor 

incDMEM(-)PR. 

3. ) Prior to transfection, pulled off media on P4-R5 cells and replaced with media -/+ 
10 inhibitor. 

FUGENE® transfection: 

4. ) Added volume of OFIIMBM® to sterile EPPENDORF® tube and carefully 
15 added correct volume of FUGENE® to media, without touching waUs of tube. 

Incubated at room temperature for 5 minutes. 

In separate EPPENDORF® tubes, added each DNA, as outlined below. 
Added FUGENE®/OPTIMEM® dropwise to DNA; incubated at room 
temperature for 15 minutes. 
20 Added 15 ^IVwell of DNA/FUGENE®/OPTIMEM® dropwise to media in 

appropriate wells on P4-R5 cells, swirling to mix. 



TABLES 



Transfection number 


Cone of 
DNA 


Vol of DNA 


Vol of 
FUGENE® 


Vol of sterile 
OFITMEM® 


1.) APP-ML-Tat- 
APPcKSw) 


0.78 )ig/|iL 


10Mg= 13 ML 


60 ML of 
FUGENE® 


1200 mL of 
OPTIMA® 


2.)APP-ML-Tat- 
APPctfWT) 


0.812 
WS/llL 


10|jig=12.2 
ML 


60MLof 
FUGENE® 


1200 mL of 
OFITMEM® 


3.)pUCd5TAT 


1.24^g/ML 


5|Jlg = 4.0|jL 


30 ML of 
FUGENE® 


600 mL of 
OPTIMEM® 


4.)p243-4 




5|Jg=9ML 


30 ML of 
FUGENE® 


600 mL of 
OPTIMEM® 
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In Table 5, "APP^ML-Tat-APPct (Sw)" refeis to pcDNAS.l neo (+) APP(1-651)SW, 
K612V-TATexonI(MlL) APP(664-695). "APP-ML-Tat-APPct (WT)" refers to 
pcDNAS.l neo (+) APP(l-651)wt, K612V-TATexonI(MlL) APP(664-695). 
5 "pUCd5TAT* is an expression vector that serves as a positive control for TAT 
expression, since it is a construct in which TAT is under the control of a strong, 
constitutive promoter (see Figure 24). "p243-4" is a control expression vector that 
directs the expression of APP. 

10 S.) Plates were transferred to an incubator and incubated for 36 hours to allow 
expression and processing of proteins. 

Day 4 

IS 6.) The protocol below was followed for lysis of the cells and nieasurement of 
galactosidase in the cell lysates. 

Measurement of fi-galactosidase in lysates of transfected cells. 

L Removed TROPIX® chemiluminescence kit(s) from cold room, allowed to 
20 come to room temperature in a 37°C water bath. 

2. P-galactosidase standards were prepared: 

Made 1:5000 dilution of ^-galactosidase stock (1 mg/ml) in lysis buffer. Did 2 
fold dilutions. 

3. Diluted TROPIX® substrate 1:25 into buffer. (Made enough for 120 \iL 
25 /weU). 

4. Added to reservoir and added 120 lil/weU. 

5. Added 10 fiL of P-galactosidase standards to colunon 12 on plate and 
incubated in daik for 1 hour. 

6. Read immediately in luininometer using standard file. Filled in required 
30 fields, read plate. 

The results are shown in Figure 34. In Figure 34, "APP(NFEV)HAMycELAG" refers 
to a protein that is a variant of APP in which NFEV is present at the ^-secretase 
cleavage site and there are epitope tags in the amino terminal portion of the protein 
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but there is no transcription factor fused to APP. "APP(Sw)tat-ct32" refers to 
pcDNAS.l neo (+) APP(1.651)SW, K612V.TATexonI(MlL) APP(664-695). 
"APP(WT)tat^t32" refers to pcDNA3.1 neo (+) APP(1.651)wt, K612V. 
TATexonI(MlL) APP(664.695). Figure 34 shows that the Swedish vereion and the 
5 wild-type version of APP appear to work about equally well in the assay. 

EXAMPLE 10 

1^685.458 

Lr685,458 is a y-secretase inhibitor having the following structure: 

10 




1^685,458 contains an hydroxyethylene dipeptide isostere and is 
thought to function as a transition state analog mimic of aspartyl proteases (Shearman 
et al., 2000, Biochemistry 39:8698-8704). 1^685,458 was prepared as follows: 

15 { lS-Benzyl-4R-[l-(lS-carbamoyl-2-phenylethylcarbamoyl)-lS-3- 

methylbutylcarbamoyl]-2R-hydroxy-5-phenylpentyl}cart)aniic acid tert-butyl ester 
(L-685,458) was prepared by the coupling of 2R-benzyl-5S-tert- 
butoxycaAonylamino-4R-(tert-butyldimethylsilanyloxy)-6-phenymexanoicac 
(Evans et al., 1985, J. Org. Chem, 50:4615-4625) with Leu-Phe-NH2 foUowed by 

20 deprotection with tetrabutylammonium fluoride. The synthesis of { lS-benzyl-4R-[l- 
(lS-carbamoyl-2-phenylethylcaibamoyl)-lS-3-methylbutylcarbamoyl]-2S-hy^ 
phenylpentyl}carbamic add tert-butyl ester (L-682,679) has been described previously 
(De Solms et al., 1991, J. Med Chem. 34:2852-2857). { lS-Ben2yl-4R-[l-(lS. 
caibamoyl-2-phenylethylcaibamoyl>lS-3-methylbutylcarbamoyl]-2-oxo-5- 

25 phenylpentyl }carbamic acid tert-butyl ester (L-684,414) was prepared by pyridinium 
dichromate-mediated oxidation of L-682,679. 
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The present invention is not to be limited in scope by the specijBc 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art from 
5 the foregoing description. Such modifications are intended to fall within the scope of 
the appended claims. 

Various publications are cited herein, the disclosures of which aie 
incorporated by refeienoe in their entireties. 
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WHAT IS CLAIMED IS: 



1 . A DNA molecule comprising a nucleotide sequence encoding a 
fusion protein comprising amino adds 589-651 selected from the group consisting of 
5 wild type APP695, the Swedish version of APP695 and the NEEV (SEQ ID NO:40) 
version of APP695 and a transcription factor where the transcription factor is fused in 
frame to the carboxyl terminus of amino acids 589-651. 



2. The DNA molecule of claim 1 whore amino adds 589-651 
10 contain a K612V mutation. 



3. The DNA molecule of claim 1 v/hexe the nucleotide sequence 
further encodes amino aads 664-695 of AFP695 wheiein amino acids 664-695 are 

fused in frame to the caiboxyl tominus of the transcription factor. 

15 

4. The DNA molecule of claim 1 where the transcription factor is 
selected from the group consisting of: HFV-l TAT, Gal4-VP16, the entire Gal4 
protein, LexA-VP16, E12, E47, Twist, PapiUomavirus E2, EBV Zta, BIV TAT, HIV- 
2 TAT, or SIV TAT. 

20 

5. The DNA molecule of claim 3 where the fusion protein is 
selected from the group consisting of: APP(l-651)wt, K612V-TATexonI(MlL) APP 
(664-695) (SEQ ID NO:4); APP(l-651)wt, K612V. Gal4-VP16(M1L) APP (664-695) 
(SEQ ID N0:8); APP(l-651)wt, TATexonI(MlL) APP (664-695) (SEQ ID NO:12); 

25 and APP(l-651)wt, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO:16). 



6. The DNA molecule of claim 3 where the fusion protdn is 
selected from the group consisting of: APP(1-651)SW, K612V-TATexonICMlL) APP 
(664-695) (SEQ ID N0:2); APP(1-651)SW, K612V, Gal4-VP16(M1L) APP (664- 

30 695) (SEQ ID N0:6); APP(1-651)SW, TATexonI(MlL) APP (664-695) (SEQ ID 
NO:10); and APP(1-651)SW, Gal4-VP16(IV[1L) APP (664-695) (SEQ ID N0:14). 

7. The DNA molecule of claim 3 where the fusion protein is 
selected from the group consisting of: APP(1-651)NEEV, K612V-TATexonI(MlL) 
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APP (664-695) (SEQ ID NO:23) and APP(1-651)NEEV, K612V, GAL4-VP16(M1L) 
APP (664-695) (SEQ ID NO:25). 

8. An expression vector comprising the DNA molecule of claim 1. 

5 

9. A eukaryotic cell comprising the DNA molecule of claim 1 . 

10. The cell of claim 9 further comprising a reporter gene where the 
reporter gene is under the control of a regulatory DNA sequence that is capable of 

10 being activated by the transcription factor. 

11. A method of identifying a substance that inhibits APP 
processing comprising: 

(a) providing a recombinant eukaryotic cell which: 

15 (i) expresses a fusion protein comprising amino acids 589- 

651 selected from the group consisting of wild type APP695, the Swedish version of 
APP695 and the NFEV (SEQ ID NO:40) version of APP695 and a transcription factor 
where the transcription factor is fused in frame to the carboxyl terminus of amino 
acids 589-651; and 

20 (ii) comprises a reporter gene operably linked to a 

regulatory DNA sequence which capable of being activated by the transcription factor; 

(b) measuring the level of reporter gene product in the cell in the 
absence of Ifae substance; 

(c) adding the compound to the cell and measuring the level of 
25 reporter gene product in the cell in the presence of the substance; 

where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

30 12. The method of claim 1 1 where amino acids 589-651 contain a 

K612V mutation. 

13. The method of claim 1 1 where the transcription factor is 
selected from the group consisting of: KV-l TAT, Gal4"VP16, the entire Gal4 



-63- 



wo 03/072041 



PCTAJS03/05458 



protein, LexA-VP16, E12, B47, Twist, Papillomavirus E2, EBV Zta, BIV TAT, HIV- 
2 TAT, or SIV TAT. 

14. The method of claim 1 1 where the fusion protein is selected 
5 from the group consisting of: APP(l-651)wt, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID N0:4); APP(l-651)wt, K612V, Gal4-VP16(M1L) APP (664-695) 
(SEQ ID N0:8); APP(l-651)wt, TATexonI(MlL) APP (664-695) (SEQ ID NO: 12); 
and APP(l-651)wt, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO: 16). 

10 15. The method of claim 1 1 where the fusion protein is selected 

from the group consisting of: APP(1-651)SW, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID N0:2); APP(1-651)SW, K612V, Gal4-VP16(M1L) APP (664-695) 
(SEQ ID N0:6); APP(1-651)SW, TATexonlCMlL) APP (664-695) (SEQ ID NO:10); 
and APP(I-651)SW, Gal4-VP16(M1L) APP (664-695) (SEQ ID NO:14). 

15 

16. The method of claim 1 1 where the fusion protein is selected 
from the group consisting of: APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664- 
695) (SEQ ID N0:23) and APP(1-651)NFEV, K612V, Gal4-VPI6(M1L) APP (664- 
695)(SEQIDNO:25). 

20 

17. A method of identifying a substance that inhibits APP 
processing comprising: 

(a) providing a recmbinant eukaryotic cell which: 

(i) expresses a fusion protein comprising an amino acid 
25 sequence from APP that is ciq>able of being cleaved by both P-secvetase and y- 

secretase and a transcription factor whoce the transcription factor is fused in frame to 
the amino add sequence from AEP; and 

(ii) comprises a reporter gene operably linked to a 
regulatory DNA sequence which citable of bemg activated by the transcription factor; 

30 (b) measuring the level of reporter gene product in the cell in ihe 

absence of the substance; 

(c) adding the compound to the ceD and measuring the level of 
reporter gene product in the cell in the presence of the substance; 
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where a decrease in the level of reporter gene product in the presence 
as compared to the absence of the substance indicates that the substance inhibits APP 
processing. 

5 18. The method of claim 17 where the amino acid sequence from 

APP comprises an amino acid sequence selected from the group consisting of 589-651 
of APP695, 589-651 of the Swedish version of APP695, and 589-651 of the NEEV 
version of AHP695. 

10 19. The method of claim 17 where the amino acid sequence from 

APP contains the amino acid sequence NLDA (SEQ ID NO:38) at the p-secietase 
cleavage site instead of the wild-type sequence KMDA (SEQ ID NO:34). 

20. The method of claim 17 wh^ the amino acid sequence from 
15 APP contains the amino acid sequence NFEV (SEQ ID NO:40) at the p-secietase 
cleavage site instead of the wild-type sequence KMDA (SEQ ID NO:34). 
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DNA sequence of APP(1"651)SW. K612V-TATexonl(m L) APP (664-695) 
(SEQ ID NO: 1) 



1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


C(MCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTCG 

s V w V ■ f viva 


CCTGCTGQCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTSTGGCAGA 


CTGAACATGC 


ACATGAATGT 

f V i W W* 1 Vi 1 


CCAGAATGGG 

^^^n 9%mf V 1 t \4«4VS 


151 


AACTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAA6GA 


201 


AGGCATCCTG 


CAGTATT6CC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 

VI I viWrtflViwVlvl 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 

111 VI 1 VIrl 1 ■ \^ 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTC6TTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGm 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTT6 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGT6GA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGAT6AGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCA7TGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGA(^GGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 

■ ViViViW \ 1 ^4ff IVI 


AATGAACATG 


CCCATTTCCA 


951 


A A A f^f^f^ AAA 

GAAAGCCAAA 


GAGAGGCTTG 


A ^^f^f^ A A /^/^ A 

AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


QCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGnCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAA6AA 


AGCCGCTCAG 
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1351 


ATCCGGTrrr 


AGSTTATGAr 


APAPPTPPGT 


GTGATTTATG 




1401 


TCAGTfTrTr 


TrrCTGCTPT 


APAAPGTGPP 


TGPAGTGGPP 


GARfiARATTP 


1451 


AG6ATGAAGT 


TGATGAGCTG 

1 vin J unuw 1 vl 


PTTPJVGAAAG 


AGPJi'AAAPTA 


TTPAGATCAP 


1501 

X«i/ vx 


GTCTTGGCCA 


ACATGATTAG 


TGAAPPAAGG 


ATPJIGTTAPG 


GAAAPGATGP 


1551 


TCTCATGCCA 


TCT7TGACCG 

1 w 1 1 1 vlr\w wvl 


AAACGAAAAC 


CAPCGTGGAG 

wr%w wu 1 vivlnvl 


PTPPTTPPPG 


1601 

xwx 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 

vinvvin 1 v 1 v*\* 


AGCCGTGGPA 
MviwVvi 1 vaviv/M 


TTPTTTTGGG 

1 1 V# 1 1 1 1 Vivivi 


1651 


GCTGACTfTG 


TGCCAGCCAA 


PAPAGAAAAP 


GAAGTTGAGP 

IWAvl 1 1 vlr\ vl w 


PTGTTGATGP 

w 1 U 1 1 vir\ 1 \J\t 


1701 

X / V X 




GrrGAPfGAG 


GAPTGAPPAP 


TPGAPPAGGT 


TPTGGGTTGA 




TAAATAirAA 


GAPGGAGGAG 


ATPTPTGAAG 


TGAATPTAGA 


TGPAGAATTP 


IflOl 


rGAPATCArT 


PARGATATGA 

\,#Mvl\in 1 Ml vlM 


AGTTPATPJ^T 


PAAGTATTGfi 


TRTTPTTTGP 


xO«,lx 


AGAARATfiTC 


GGTTPAAAPA 


AAGGTGPJ\AT 


PATTGGAPTP 


ATGGTRGGPG 

r\ I uU 1 viV3vivvl 


IQOl 


RTGTTRTCAT 


AGPGAPAGTG 


ATPGTCATCA 

M i Vvl 1 1 vr\ 


PPTTGGTGAT 

WW 1 1 vivi 1 vlr\ 1 


GPTGAAGAAfi 

VI w 1 vlrV^vlr\r\V3 




AAAAAGTTTR 


GTAPPGAGPT 


PGGATPPAPT 


AGTPPAGTGT 

Mvl 1 1 vl 1 


GGTRGAATTP 

viva i \j\jr\r\ i i w 


2001 


TGCAGATATC 


CTGGAGCCAG 


TAGATCCTAG 

1 fx^jfy i WW 1 lAvi 


ACTAGAGPPP 

Mw 1 r\vmviwww 


TGGAAGPATP 

1 \jwirv\viwr\ 1 \j 


2051 


CAGGAAGTCA 


GCCTAAAACT 


GCTTGTACCA 

Mw 1 1 vl 1 rAw wn 


ATTGCTATTG 

r\ 1 1 wiw 1 m 1 vl 


TAAAAAGTGT 

1 rvvwivl 1 vl 1 


2101 


TGCmrCATT 


GCCAAGTTTG 


TTTCATGACA 


AAAGCCTTAG 


GCATCTCCTA 


2151 


TGGCAGGAAG 


AAGCGGAGAC 


AGCGACGAAG 


AGCTCATCA6 


AACAGTCAGA 


2201 


CTCATCMGC 


TTCTCTATCA 


AAGCAGAGGA 


TATCCAGCAC 


AGTGGCGGCC 


2251 


GCAGACGCCG 


CTGTCACCCC 


AGAGGAGCGC 


CACCTGTCCA 


AGATGCAGCA 


2301 


GAACGQCTAC 


GAAAATCCAA 


CCTACAAGTT 


CTTTGAGCAG 


ATGCAGAACT 


2351 


AG 
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(SEQ ID NO: 2) 

Amino acid sequence of APP(1-651)SW, K612V-TATexonI(MlL) APP (664-695) 

ml pgl an 1 1 aawtaral evptdgnagl 1 aepql amf cgpl nmhmnvqngkwdsdpsgt ktci dt kegi 1 qycq 

evypelqltnvveanqpvtiqnwckrgrkqckthphf vipyrcl vgef 1 sdal 1 vpdkckfl hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttslatttttttesveevvrvpttaastpd 
avdkyi etpgdenehahf qkakerl eakhrermsq vmreweeaerqaknl pkadkka v1 qhf qekvesi eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kky vraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeelqdevdel 1 qkeqnysddvl anm1 sepr1 sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqvl vf f aedvgsnkgai i gl mvgg vyiatyi vi 1 1 vml kkk kl gt e l gstspvwwns 

6 

a4i -l epvdprl epwkhpgsgpktactncvckkccf hcQvcfmtkal gi svgrkkrrqrrrahansqthqasl skq 

7 8 
ri sstvaa a daavtpeerhl skmqongvenptvkf f eqmqn 
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DNA sequence of APP(l-651)wt. K612V-TATexonl(MlL) APP (664-695) 



(SEQ 


ID NO: 3) 










1 


ATGCTGCCCG 


GrrrGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GaGGAGGTA 


CCCACTGATG 


GTAATGCTGG 


CC7GCTGGCT 


GAACCCCAGA 


101 


TTSCCAT6TT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAA7GT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CA6ATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGrreACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


■nTGTGATTC 


CCTACCGCTG 


351 


CTTAG7TGGT 


GAGTITATAA 


6TGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCA(3TGAG 


AAGAGTACCA 


ACHGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


1TGTGTGTTG 


CCCACTGQCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCQGA 


TGTCTQGTCG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGHCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCHG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCAT6AGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAqCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCAT66TGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


6TGAT7TATG 


AGCGCATGAA 
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1401 TCAGTCTCTC TCCCTGCTCT ACAACGTCCC TGCAGTQGCC GAGGAGAHC 
1451 AGGATGAACT TGATCAGCT6 CTTCAGAAA6 AGCAAAACTA TTCAGATGAC 
1501 GTCnGGCCA ACATGAHAG TGAACCAAGG ATCAGTTACG GAAAC6ATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTG6CA ncmTOGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAAGATGGA TGCAGAAHC 
1801 CGACATGACT CAGGATATGA AGTTCATCAT CAAGTAnGG TGITCTTTGC 
1851 AGAAGATGTG QGITCAAACA AAGGTGCAAT CATTGGACTC ATQGTQG6CG 
1901 GTffmGTCAT AGCGACAGTG ATCGTCATCA CCnGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA 6CCTAAAACT GCTTGTACCA ATTGCTAnG TAAAAAGTGT 
2101 TGCTTTCAn GCCAAGITTG TTTCATGACA AAAGCCHAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCA6ACGCCG CTGTCACCCC AGAQGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 4) 

Amino acid sequence of APP(l-651)wt, K612V-TATexonI(MlL) APP (664-695) 

ml pgl a1 1 1 1 aawtaral evptdgnagi 1 aepq1 amf cgri nmhmnvqngkwdsdpsgtktcl dtkegl 1 qycq 

evypel qi tn weanqpvtl qnwckrgrkqckthphf v1 pyrcl vgef i sdal 1 vpdkckfl hqermdvcet hi h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyl talqavpprprhvf nnil kky vraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si lynvpavaeeiqdevdel 1 qkeqnysddvl anml sepri sygndal 

mpsl tetkttve! 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgi tt rpgsgl tnl ktee1 sev 
2 3 4 5 

fgadaef rhdsgyevhhqyl vf f aedvgsnkgai iglmvggyviatyl vitl vml kk kkl gt e l gstspvwwns 

6 

ad tl epvdprl eDwkhpqsqpktactncvckkccf hcqvcfmtkal gi svqrkkrrarrrahqnsQthqasl skq 

7 8 
r i sstVQaa daavtpeerhl skniQQnqvenptykff eaman 
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DMA sequence of APP(1-651)SW. K612V-GAL4VP16(M1 L) APP (664-695) 
(SEQ ID NO: 5) 



1 


ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGCTA 


CCCACTGATG 


GTAATGCTQG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTCGCAGA 


CTGAACATGC 


ACATCAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


AT'AOOA A OO A 

ATACCAA^ 


201 


AG6CATCCTG 


CAGTATTGCC 


A A O A A f^T'/^TA 

AAGAAGTCTA 


OOOTOA AOTO 

CCCTGAACTG 


O A O A TO A OO A 

CAGATCACCA 


251 


ATGTGGTAGA 


A f^f^t> A A f^t* A A 

AGCCAACCAA 


CCAGTGACCA 


XOO AOA AOXO 

TCCAGAACTG 


OXOOA Ar^f*f^f> 

GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


XXXOXO A XXO 

TTT6T6ATTC 


OOX A f^f^r^f^TT* 

CCTACC6CTG 


351 


CTTAGTTBGT 


AOTTTAXA A 

6A6TTTATAA 


nnrn a x<^p/*/*x 

6T6AT6CCCT 


TCTCGTTCCT 


OAOA AOXOOA 


401 


A ATTT/^TTA/^A 

AATTCTTACA 


CCAGGAGAGG 


A X/^r* A XO III' 

AT6GATGTTT 


OOO A A AOXO A 

6C6AAACTCA 


XOXXO A OXOO 

TCTTCACTBu 


Act 

451 


CACACC6TCG 


CCAAA6A6AC 


AT6CAGTGA6 


A AOAOXAOO A 

AAGA6TACCA 


A OXXOO A TO A 

ACTT6CATGA 


501 


CTACG6CAT6 


TT6CT6CCCT 


/%/*/V^ A AXXr»A 

GCuGAATTGA 


OA AOXXOOOA 

CAA6TTCCGA 


nr*nf^TA o a ox 

GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTQGCT 


OA AOA A AOXn 

GAAGAAAGTG 


AOA AXOXOO A 

ACAATGTGGA 


TTOXO OTO A T 

TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


xox/*xoox/>/* 

T&rCTGGTGG 


f\f>/^f^fs A OO A O 

GGCGGAGCAG 


AOAOAOAOXA 

ACACAGACTA 


651 


TGCAGATGGG 


A fVTf^ A Af>Af^A 

AGTGAAGACA 


A A OXA OX A O A 

AAGTAGTAGA 


AOTAOO A O AO 

AGTAGCAGAG 


O A OO A A O A A O 

GAGGAAGAA6 


701 


TGGCTGAGGT 


GGAAGAAGAA 


r* A Ar*/*r»r» Axo 

GAAGCCGATG 


A TO A OO A OO A 

ATGACGAGGA 


OO AXOAOO AT 

CGATGAGGAT 


751 


GGTGAT6A6G 


TA/>Ar*/>A AOA 

TAGAGGAAGA 


r»ooxo A OO A A 

GGCTGAGGAA 


OOOTAOOA AO 

CCCTACGAAG 


A A OOO A OAOA 

AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


OO A OO A OO A r> 

CCACCACCAC 


O AOO AOO AOA 

CACCACCACA 


O A OTOTOTOO 

GAGTCTGTGG 


ATI 

851 


AAGAGbrGGT 


TCGAGTTCCT 


AOA AO AOO AO 

ACAACAGCAG 


OO A OTA 

CCAGlACCCC 


TO A TOOOO'I 1 

TGATGCCGTr 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAAC6TC 


AAGCAAAGAA 


CHGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTCGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGHCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 
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1301 TAAAGCATTT CGA6CATCTG CGCATGGTGG ATCCCAAGAA AGCC6CTCAG 
1351 ATCCGGTCCC AGGTTTATGAC ACACCTCCGT GTGATTTATG AGCGCATGAA 
1401 TCAGTCTCTC TCCCTQCTCT ACAACGTGCC TGCAGTGGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AQCAAAACTA HCAGATGAC 
1501 GTCnGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGQ 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGT6GAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTGGCA nCTTTTQGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGTTGAGC CTCTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGiTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAATCTAGA TGCAGAATTC 
1801 CGACATGACT CAGGATAT6A AGTTCATCAT CAAGTATTGG TGnCTTTGC 
1851 AGAAGATGTG GGHCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTT6TCAT AGCGACAGTG ATCGTCATCA CCTTQGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CQGATCCACT AGTCCAGTGT QGTGGAATTC 
2001 TGCAGATATC AAGCTACTGr CTTCTATCGA ACAAGCATGC GATATTTQCC 
2051 GACHAAAAA GCTCAAGTGC TCCAAAGAAA AACC6AAGTG CGCCAAGTGT 
2101 CTGAAGAACA ACTGGGAGTG TCGCTACTCT CCCAAAACCA AAAGGTCTCC 
2151 GCTGACTAGG GCACATCTGA CAGAAGTGGA ATCAAGGCTA GAAAGACTGG 
2201 AACAGCTATT TCTACTGATT TTTCCTCGAG AAGACCHGA CATGATTTTG 
2251 AAAATGGATT CTTTACAGGA TATAAAAGCA TTGTTAACAG GATTATmGT 
2301 ACAAGATAAT GTGAATAAAG ATGCCGTCAC AGATAGATTG GCTTCAGTGG 
2351 AGACTGATAT GCCTCTAACA TTGAGACA6C ATAGAATAAG TGCGACATCA 
2401 TCATCGGAAG AGAGTAGTAA CAAAGGTCAA AGACAGHGA CTGTATCGGG 
2451 AAHCCCGGG GATCTGGCCC CCCCGACCGA TGTCAGCCTG GGGGACGAGC 
2501 TCCACTTAGA CGGCGAGGAC GTGGCGATGG CGCATGCCGA CGCGCTAGAC 
2551 GATTTCGATC TGGACATGTT GQGGGACGGG GATTCCCCQG GTCCGGGAH 



FIG.6B 
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2601 TACCCCCCAC GACTCCGCCC CCTAC6GCGC TCTGGATATG GCCGACHCG 
2651 AGTTTGAGCA GATGnTACC GATGCCCTTG GAAHGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATQCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GAT6CAGAAC TAG 



FIG.6C 
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(SEQ ID NO; 6) 

Amino acid sequence of APP(1-651)SW. K612V. GiAL4-VP16(delMl) APP (664-695) 

ml pgl a1 1 1 1 aawtaral evptdgnagi 1 aepqi amf cgri nmhrnnvqngkwdsdpsgtktd dtkegl 1 qycq 

evypel q1 tnvveanqpvtlqnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckfl hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgidkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrri al enyi talqavpprprhvf nml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvrathi rviyermnqsl si 1 ynvpavaeei qdevdel Iqkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqyl vf f aedvgsnkgai iglravggyviatvi vi tl vml kk kklgtclgstspvwwns 

fld4 ^cl 1 ssi eoacdi crl kkl kcskekokcakcl knnwecrvspkt krsol t rahl tevesrl erl eql fll i f predl d 

6 

mi 1 kmdsl odi kal 1 tgl f vadnvnkdavtdrl asvetdmol tl rohri satssseessnkgarol tvsqi pgdl aop 

tdvsl qdel hi dgedvamahadal ddf d1 dml qdqdspqpqf tphdsapygal dmadf ef eamf tdal gi dev 

7 8 
qqdi qhaqoQQ d aavtpeerhl skmqqngvenptvkf f eqmqn 



FIG. 7 
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DNA sequence of APP(l-651)wt. K612V, 6AL4-VP16(deIMl) APP (664-695) 
(SEQ ID NO: 7) 



1 


ATGCTGCCCG 


GTmGGCACT GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG GTAATGCT6G 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGUGGGATT 


CAGATCCATC AGGGACCAAA 


ACCTQCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CACTATTGCC AAGAAGTrCTA 


CCCTGAACre 


CAGATCACCA 


251 


ATGTOGTAGA 


AGCCAACCAA CCAGTGACCA 


TCCAGAACTC 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC CCATCCCCAC 


TTTGIGAnC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGfTITATAA GTGATQCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AAnCTTACA 


CCAGGAGAGG ATGGATGTTT 


GCGAAACTCA 


TCTTCACTCG 


451 


CACACC6TCG 


CCAAAGAGAC ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACG6CATG 


TTGCTGCCCT GCGGAAHGA 


CAAGHCCGA 


GGGGTAGAGT 


551 


TTGTGTGnG 


CCCACTGGCT GAAGAAAGTG 


ACAATGTGGA 


nCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTC6GA TGTCTG6TGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA AAGTAGTAGA 


AGTA6CAGAG 


6AGGAAGAAG 


701 


TGGCTGA6GT 


GGAAGAAGAA GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTCCCA CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGHCCT ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCHG AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG GCAGAACGTC 


AAGCAAAGAA 


CHGCCTAAA 


1051 


GCTGATAAGA 


A6GCAGTTAT CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC TGTTCCTCCT 

FIG.8A 


CGGCCTC6TC 


ACGT6TTCAA 
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1251 


TAT6CTAMG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCC6T 


GTGATTTATG 


AGCGCATGAA 


1401 


TCAGTCraC 


TCCCT6CTCT 


ACAACGTCCC 


TGCA6TGGCC 


GAGGAGAHC 


1451 


AGGATGAAGT 


TGATGAGCTG 


CnCAGAAAG 


AGCAAAACTA 


nCAGATGAC 


1501 


GTCTTGQCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATSCCA 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCIIIIGGG 


1651 


GCTGACTCT6 


TGCCAGCCAA 


CACAGAAAAC 


GAAGTT6A6C 


CTGHGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGQGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAAGATGGA 


TGCAGAATTC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAGTAnGG 


TGTTCrrTGC 


1851 


AGAAGATGfTG 


GGTTCAAACA 


AAQGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTCTTGTCAT 


AGCGACAGTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGrCCAGTGT 


GGTCGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CnCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACHAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCQCTACTCT 


CCCAAAACCA 


AAA6GTCTCC 


2151 


GCTGACTAGG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTAH 


TCTACT6ATT 


TTTCCTCGAG 


AAGACCHGA 


CATGAIIIIG 


2251 


AAAATGGATT 


CTTTACAGGA 


TATAAAAGCA 


TTGTTAACAG 


GATTATTTGT 


2301 


ACAAGATAAT 


GTGAATAAAG 


ATGCCGTCAC 


AGATAGATTG 


GCTTCAGrrGG 


2351 


AGACTGATAT 

nvmv# 1 Wirt 1 rii 


GCCTCTAACA 


TT6AGACAGC 


ATAGAATAA6 


TRCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGHGA 


CTGTATCGGG 


2451 


AATTCCCGQG 


GATCTG6CCC 


CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTQGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GAHTCGATC 


TGGACATGTT 


GGGGGACGGG 


GATTCCCCGG 


GTCCGGGATT 



FIG. SB 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGC6C TCTGGATATG GCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCHG GAATTGACGA GTACGGTGG6 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTSTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GAT6CAGAAC TAG 

FIG.8C 
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(SEQ ID NO: 8) 

Amino acid sequence of APP(l-651)wt, K612V. GAL4-VP16(de1Ml) APP (664-695) 

ml pgl al 1 1 1 aawtaral evptdgnagi 1 aepql amf cgr1 nmhmnvqngkwdsdpsgt ktc1 dtkegl 1 qycq 

evypel qi tnvveanqpvtiqnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckfl hqermdvcethl h 

whtvaketcsekstnl hdygmllpcgidkf rgvefvccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttslatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsq vnreweeaerqaknl pkadkkavl qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrrnvd 

pkkaaqi rsqvmthi rviyermnqsl si lynvpavaeeiqdevdellqkeqnysddvl anmi sepri sygndal 

npsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tn1 kteei sev 
2 3 4 5 

jgndaef rhdsgyevhhqkl vf f aedvgsnkgai i gl tnvg qvviatv i vi tl vml kk kkl gtol gstspvwwns 

adi -kl 1 ssi eoacdi crl kkl kcskekpkcakcl knnwecrvspktkrspl trahl tevesri erl eql f 1 1 i f predl d 

6 

mi 1 kmdsl odi kal 1 tgl f vqdnvnkdavtdrl asvetdmpl tl rohri satssseessnkgorol tvsoi pgdl app 

tdvsl qdel hi dgedvamahadal ddf d1 dm! gdgdspgpgf tphdsap ygal dmadf ef eamf tda1 gi day 

7 8 
qqdiqhGqQQa daavtpeerhlskniQqnqvenptvkffeqniqn 

FiG.9 
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DNA sequence of APP(1-651)SW. TATexonKMl L) APP (664-695) 



(SEQ 


ID NO: 9) 








1 


ATQCTGCCC6 


GTrrOGCACT GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGCTA 


CCCACTGATG 6TAATGCTGG 


CCTGCTGGCr 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGT6GCAGA CTGAACATGC 


ACAT6AATGT 


CCAGAATGGG 


151 


AAGTGQGATT 


CAGATCCATC AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC AAGAAGTrCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA CCAGPGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA GTGATGCCCT 


TCTCGHCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC ATSCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


TTGCTGCCCT GCGGAATTGA 


CAAGnCCGA 


GGGGTAGAGT 


551 


TTGTGTGnG 


CCCACTGGCT GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA TGTCTQGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGllGATGAGG 


TAGAGGAAGA GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCAHGCCA CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AA6A6GTGGT 


TCGAGTTCCT ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC TGGGGATGAG 


AATGAACAT6 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAA6A 


AGGCAGHAT CCAGCATTTC 


CAGGAGAAAG 


TGGAATCnT 


1101 


GGAACAGGAA 


GCAGCCAACG AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC TGTTCCTCCT 

FIG.10A 


CGGCCTCGTC 


ACGTGnCAA 



wo 03/072041 



PCT/US03/05458 



17/76 

1251 TAT6CTAAA6 AA6TAT6TCC GCGCAGAACA GAAGGACAGA CAQCACACCC 
1301 TAAAGCATTT C6AGCATGTG CGCATGGTGG ATCCCAAGAA AGCCGCTCAG 
1351 ATCCGGTCCC AG6TTATGAC ACACCTCCGT ffrGATTTATG AGCGCATGAA 
1401 TCACTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTCGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTrTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCffTSGCA TTCTTTTGOG 
1651 GCTGACTCTG TGCCA6CCAA CACAGAAAAC GAAGHGAGC CTGTTGATGC 
1701 CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACQGAGGAG ATCTCTGAAG TCAATCTAGA TGCAGAATTC 
1801 CGACATGACT CAGGATATCA AGTTCATCAT CAAAAATTGG TGTTCTTTGC 
1851 AGAAGATGTG GGHCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAQCT C6GATCCACT AGTCCAGTGT GGTGGAATTC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTA6AGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA AnGCTAHG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTITG TTTCATGACA AAAGCCHAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGCGACGAAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCCG CTGTCACCCC AGAGGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 



FIG. 1 0B 
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(SEQ ID NO: 10) 

'Amino add sequence of APP(1-651)SW. TATexonl(MlL) APP (664-695) 

ml pgl all 11 aawtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgt ktcl dtkegl 1 qycq 

evypel q1 tnvveanqpvti qnwckrgrkqckt hphf vi py rcl vgef 1 sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgl dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdKyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka v1 qhf qekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi talqavpprprhvf nml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvmthl rviyermnqsl si 1 ynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddlqpwhsfgadsvpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

rfldaef rhdsgyevhhqlsl vf f aedvgsnkgal i gl mvggyylatyi vi tl vml kk kkl gt e l gstspvwwns 

6 

ad tl epvdprl eDWkhpqsqpktactncyckkccfhcQvcf mtkal gi svorkkr ror rrahonsothqasl skq 

7 8 
Pi sst vaa a daa vtpeerhl skmaangvenptykf f eqmqn 

FIG. 11 
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DNA sequence of APP(l-651)wt, TATexonUMl L) -APP (664-695) 



(SEO 


ID NO- 111 

XL/ 1 • JLjl/ 










1 


ATGCTftCCCG 


GTTTGGTArT 


firTrpTRTTf; 




uulaL 1 Ltal:naU 




vIO i UwinlSvl 1 M 




fiTAATfiPTfSft 
UlMMluu iVau 






101 


TTGrrATfiTT 




rTCAAPATRP 


APATCAATCT 




151 


AAGTGGGATT 


PAGATrPATr 


AGGGAfPAAA 


APPTCPATTrJ 
fWAa 1 V3UM 1 1 U 




201 


AGGCATCCTG 


PAGTATTGrr 


AAGAAGTPTA 


PPPTCAAPTfi 


PAfiATPAPPA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 

wwnvi 1 vmwwn 


TCPAGAAPTG 


GTGPAAGPRfi 


301 


GGCCGCAAGC 


AGTGCAAGAC 

r^vi 1 viwTvivinw 


CCATCCCCAC 

w 1 wVA/wrliW 


TTTGTGATTP 


PPTAPPGPTR 


351 


CTTAGTTGGT 


GAGTTTATAA 

vinvi 1 1 1 n 1 rvi 


GTGATGCCCT 

vl 1 virl 1 vlwV/w 1 


TCTPGTTPPT 

1 w 1 wvl i 1 1 


GAPAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 

wwrivivinvinvKi 


ATGGATGTTT 

f> 1 vivin 1 vl 1 1 1 


GPGAAAPTPA 


TPTTPAPTRG 

lull VMw 1 dvl 


451 


CACACCGTCG 


CCAAAGAGAP 


ATGPAGTGAG 


AAGAGTAPPA 


APTTfiPATRA 


501 


CTACGGCATG 


TTGCTGCCCT 

1 1 VIS/ 1 VIOOO 1 


GCGGAATTGA 

vlwvivirvA 1 1 vlr\ 


PAAGTTPPGA 


RGGGTAGAGT 


551 


TTGTGTGTTG 

1 IVIlVlivll Ivl 


CCCACTGGCT 


6AAGAAAGTG 


APJVATGTGGA 
nl«nn 1 vl 1 Uvln 


TTPTCPTCAT 


601 


GCGGAGGAGG 

viwuvinvKinvivi 


ATGACTCGGA 


TGTPTGGTRR 

1 vll u 1 vivl 1 UO 


GGPGGAGPAG 


APAPARAPTA 


651 


TGCAGATGGG 

1 VI unvin 1 vivivi 


AGTGAAGArA 


AAGTAGTAGA 

nrtU 1 Mvl 1 r\V3n 


AGTAGPJVGAG 


r^Af^AAf^AAf^ 


701 


TGGfTGAGRT 

t vIvIV/ 1 vlnvlvi 1 


GGAAGAAGAA 


RAARPPGATC 


ATCAPfiAfiRA 


PRATCAr^fiAT 


751 


GGTGATGAGG 

VIVI 1 viii 1 vmvivi 


TAGAGGAAGA 


GGPTGAGGAA 

uvlLr 1 vlr\viV3r\r\ 


PfPTAPGAAG 


AAf^PPAPARA 


801 


GAGAACCACC 

vinvionvwr^ww 


AGCATTGCCA 


CCACCACCAP 

UwnwWAwwnV/ 


PAPPAPPAPJV 

wM\/UnwwMUr\ 




851 


AAGAGGTGGT 

rVtVinVIVi 1 VIVI 1 


TCGAGTTCCT 

1 wvinvi 1 1 WW 1 


ACAACAGCAG 

nwrviwnviwrwa 


PPAGTAPPPP 


TGATRPPRTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAQGCTTG 




CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGHAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACAT1G6 


1151 


CCA6AGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACffrenCAA 



FIG.12A 
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1251 


TATGCTAAAG 


AA6TATGTCC 


RrRrARAAfA RAAfiRAPAfiA 


PARPAPAPPP 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTfifi ATPrPAARAA 

Vrviwri 1 vivi 1 uvl r\ 1 Uv/VfrVWarW 


ARPPRPTPAfi 


1351 


ATCCGGTCCC 


AGGTTATGAC 

r^vivi 1 m 1 vmw 


ACACCTCCGT GTGATTTATfi 


ARPi^ATRAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 

1 WWV 1 VIW 1 W 1 


ACAACGTGCC TGPAGTGGCP 

rwrmwU i viww i viwnvl ■ Uuww 


RARRARATTP 


1451 


AGGATGAAGT 


TGATGAGCTG 

■ vin 1 vinviw 1 VI 


CTTCAGAAAG AGPAAAAPTA 


TTPARATRAP 

1 1 w\Un 1 vV\v 


1501 


GTCTTGGCCA 


ACATGATTAG 

iiwr^ 1 vin 1 1 nvi 


TGAACCAAGG ATCAGTTAPG 

1 virmowrvivivi i vrivl i i r\\/vl 


GAAAPRATRP 


1551 


TCTCATGCCA 


TCTTTGAffR 


AAAPGAAAAP PAPPRTCRAR 


PTPPTTPPPR 


1601 


TGAATGGAGA 


GTrrAcrrTG 

vl 1 1 \*rVJ\J\^ 1 U 


GAPfiATPTPP ARPPRTCRPA 


TTPTTTTRRR 
MUM 1 ivauu 


1651 




TfirfAfirrAA 


PAPJkRAAAAP (^LL(XVT(^Lnr 


PTRTTRATfiP 


1701 






RAPTRAPPAP TPRAPPARRT 


TPTRRRTTCA 


17R1 
X/ ^x 


PAAATATfAA 


vlnvA3Un\3vlr\vl 


ATPTPTRAAR TRAARATRRA 


TCTAfiAATTP 


1801 


CGACATGACT 


TAGGATATGA 


AGTTPATPAT PAAAAATTRR 

rAvi 1 1 \^r\ 1 l^rt 1 \jr\r\r\rv\ 1 1 Uvl 


TRTTPTTTRP 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGPAAT PATTRRAPTP 


ATRRTRRRPR 


1901 


GTGTTGTCAT 

VI 1 u 1 1 VI 1 vn 1 


AGCGACAGTG 

ft VI w Vvl\/r\VI 1 VI 


ATCGTCATCA CCTTGGTRAT 


RPTRAARAAR 

vlU 1 vlnMvirVAvl 


1951 


AAAAAGCTTG 

nnnrinviw i i vi 


GTACCGAGCT 

VI 1 r^wv^vviviv 1 


CGGATCCACT AGTCPAGTGT 

wvivir^ 1 Vf 1 nvi i wonvi i vi i 


GGTGGAATTP 

vivl 1 vivWA 1 1 w 


2001 

Cot WX 


TGCAGATATC 


CTGGAGCCAG 

V/ 1 VlVirWiwV/riVl 


TAGATCCTAG ACTAGAGCCC 

1 r\vm 1 i nvi nw i nvir\viwV/w 


TGGAAGPATP 

1 vivlnr\V3Ur\ 1 U 


2051 


CAGGAAGTCA 


GCCTAAAACT 


GCTTGTACCA ATTGCTATTG 

viv/ 1 1 vl 1 rt\rfV/fi 1 1 viv* try I 1 vi 


TAAAAAGTGT 

1 rwWAvi 1 Vi I 


2101 


TGCTTTCATT 


GCCAAGTTTG 


TTTCATGACA AAAGCCTTAG 


GCATCTCCTA 


2151 


TGGCAGGAAG 


AAGCGGAGAC 


AGCGACGAAG AGCTCATCAG 


AACAGTCAGA 


2201 


CTCATCAAGC 


TTCTCTATCA 


AAGCAGAGGA TATCCA6CAC 


AGTGGCGGCC 


2251 


GCAGACGCCG 


CTGTCACCCC 


AGAGGAGCGC CACCTGTCCA 


AGATGCAGCA 


2301 


GAACGGCTAC 


GAAAATCCAA 


CCTACAAGH CTTTGAGCAG 


ATGCAGAACT 


2351 


AG 
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(SEQ ID NO: 12) 

Amino acid sequence of APP(l-651)wt. TATexonl(MlL) APP (664-695) 

ml pgl aim aavrtaral evptdgnagl 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tn vveanqpvt i qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethi h 

whtvaketcsekstnl hdygml 1 pcgl dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

a vdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveami ndrrri alenyi talqavpprprhvfnml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsq vmthl rviyermnqsl si lynvpavaeei qdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvda rpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

kmdaef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvq qvviatv i vi tl vml kk kkl gtel gstspvwwns 

6 

adtl epvdpri epwkhpgsqpktactncyckkccf hcqvcf mtkal gi sygrkkrrqrrrahqnsqthqasl skq 

7 8 
ri sstvaoa daaytpeerhl skmaonqvenptykf f eamon 
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DMA sequence of APP(1-651)SW. GAL4VP16(delMet) APP (664-695) 



(SEQ 


ID NO: 13) 










1 


ATGCTCCCCG 


GTTTQQCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCT6GAGGTA 


CCCACTGAT6 


6TAATSCTGG 


CCTCCTGGCT 


GAACCCCAGA 


101 


TTGCCATGTT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCA6AATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGnrSGTAGA 


AGCCAACCAA 


CCAGTCACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGAnC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGITTATAA 


GTCATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCITACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


HGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTQGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCAHGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGH 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AAT6AACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTCCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGsrac 


AGGTTATGAC 


ACACaCCGT 


GTCATTTATG 


AGCGCATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


GAGGAGAHC 


1451 


AGGATGAAGT 


TGATGAGCTG 


CnCAGAAAG 


AGCAAAACTA 


JTCAGATGAC 


1501 


GTCnGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCnCCCG 


1601 


TGAAT6GAGA 


GTTCAGCCTG 


GACGATCTCC 


AGCCGTGGCA 


TTCIIIIGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


GAAGHGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


TCTGG6TTGA 


1751 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAATCTAGA 


TGCAGAAHC 


1801 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAAHGG 


TGrrCTTTGC 


1851 


AGAAGATGTG 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


1901 


GTCrrercAT 


AGCGACAGTTG 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT 


CGGATCCACT 


AGTCCAGfTGT 


GGTCGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT 


CnCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACHAAAAA 


GCTCAAGTGC 


TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG 


TCGCTACTCT 


CCCAAAACCA 


AAAGGTCTCC 


2151 


GCTGACTAQG 


GCACATCTGA 


CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTATT 


TCTACTGATT 


TTTCCTCGAG 


AAGACCTT6A 


CAT6AI 1 1 IG 


2251 


AAAATGGATT 


CTTTACAGGA 


TATAAAAGCA 


nGTTAACAG 


GAnATITGT 


2j01 


A/*AA/^ATA AT 

AwwGATAAT 


PTY* A ATA A AP 

bTuAAIAAAu 


ATPPrPTPAP 


APATAPATTP 

AoAiAuAl lu 


rrTTCAtiTfid 
bLi ILAulbu 


2351 


AGACTGATAT 


GCCTCTAACA 


TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATCGGAAG 


AGAGTAGTAA 


CAAAGGTCAA 


AGACAGTTGA 


CTGTATCGGG 


2451 


AATTCCCGQG 


GATCTGGCCC 


CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACTTAGA 


CGGCGAGGAC 


GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT 


GGGGGACGGG 


GAHCCCCGG 


GTCCGGGATT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTG6ATATG GCCGACHCG 
2651 AGTITGAQCA GATGTTTACC GATGCCCTTG GAATTGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCG6 CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.14C 
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(SEQ ID NO: 14) 

Amino acid sequence of APP(1-651)SW. K612V. GAL4-VP16(delMl) APR (664-695) 

m1 pgl aim aawtaral evptdgnagi 1 aepqi amf cgri nmhmnvqngkwdsdpsgtktci dtkegl 1 qycq 

evypelqi tnvveanqpvtiqnwckrgrkqckthphfvlpyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccp1 aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdky] et pgdenehahfqkakerl eakhrermsqvmreweeaerqaknl pkadkka vi qhf qekvesi eqe 

aanerqql vethmarveaml ndrrrl alenyi talqavpprprhvfnml kkyvraeqkdrqhtl khfehvrmvd 

pkkaaqi rsqvnrthl rvlyermnqsl si lynvpavaeeiqdevdellqkeqnysddvl anmi sepri sygndal 

mpsl tetktt vel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgi tt rpgsgl tni kteei sev 
2 3 4 5 

nldaef rhdsgyevhhqkl vf f aedvgsnkgai i gl mvggyyiatyi vi tl vml kk kkl gtel gstspvwwns 

adi 4cl 1 ssi eoacdi crl kkl kcskekokcakcl knnwecrvspktkrspl trahl tevesrl erl eol f 11 i f predl d 

6 

mi 1 kmdsl adi kal 1 tgl f vqdnvnkdavtdrl asvetdmol tl rohr i sat ssseessnkgqrql tvsqi pqdl aop 

tdvsl qdel hi dgedvamahadal ddf d1 dm1 gdqdspgpqf tphdsapygal dmadf ef eqmf tdal gi dev 

7 8 
qqdi qhsqaaa daaytpeerhl skmqqngyenptvkf f eqmqn 
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DNA sequence of APP(l-651)wt. 6AL4VP16(delMet) APR (664-695) 
(SEQ ID NO: 15) 



1 


ATGCTGCCC6 


GmGGCAa 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACTGATG 


GTAATGCTQG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGaATGTT 


aGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAQGA 


201 


AGGCATCQG 


CAGTAHGCC 


AAGAAGfTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTQGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GASTTTATAA 


GTCA7GCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCTTACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


HGCTGCCCT 


GCGGAAHGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


TTGnarfflTG 


CCCACTGGCT 


GAAGAAAGTO 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGQG 


AGT6AAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCAHGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTGG 


ooi 


AAPAPPT^iPT 

AAuAub 1 ub 1 




APAAPAf^AC 




TCATCPPriTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCHG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


6CTGATAAGA 


AG6CAGTTAT 


CCAGCATTTC 


CA6GAGAAAG 


T6GAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AQCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTG1TCAA 
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1251 TATGCTAAAG AAGTAT6TCC GCGCAGAACA GAAGGACAGA CAQCACACCC 
1301 TAAAGCATTT CGAGCATGTG CGCATGGTGG ATCCCAAGAA AGCCGaCAG 
1351 ATCCGGTCCC AGGTTATSAC ACACCTCCGT GTGAnTATG AGCQCATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTCGCC GAGGAGATTC 
1451 AGGATGAAGT TGATGAGCT6 CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCrreGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTTGACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GHCAGCCTG GACGATCTCC AGCCGfTOGCA TTCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGHGAGC CTGTTGATGC 
1701 CCQCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGGGTTGA 
1751 CAAATATCAA GACGGAGGAG ATCTCTGAAG TGAAGATGGA TGCAGAATTC 
1801 CGACATGACT CAGGATATGA AGHCATCAT CAAAAATTGG TGTTCTrTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CAnGGACTC ATGGTGGGCG 
1901 GTGTTGTCAT AGCGACAGTTG ATC6TCATCA CCrTGGTGAT GCTGAAGAAG 
1951 AAAAAGCTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGrGGAATTC 
2001 TGCAGATATC AAGCTACTGT CTTCTATCGA ACAAGCATGC GATATTTGCC 
2051 GACTTAAAAA GCTCAAGTGC TCCAAAGAAA AACCGAAGTG CGCCAAGTGT 
2101 CTGAAGAACA ACTGGGAGTG TCGCTACTCT CCCAAAACCA AAAGGTCTCC 
2151 GCTGACTAGG GCACATCTGA CAGAAGTCGA ATCAAGGCTA GAAAGACT6G 
2201 AACAGCTATT TCTACTGAH TTTCCTCGA6 AAGACCTTGA CATGATTITG 
2251 AAAATGGAH CTTTACAGGA TATAAAAGCA TTGnAACAG GAnATTTGT 
2301 ACAAGATAAT GTGAATAAAG ATGCCGTCAC AGATAGAHG GCnCAGTGG 
2351 AGACTGATAT GCCTCTAACA HGAGACAGC ATAGAATAAG TGCGACATCA 
2401 TCATCGGAAG AGAGTAGTAA CAAAGGTCAA AGACAGTTGA CTGTATCGGG 
2451 AATTCCCGGG GATCTGGCCC CCCCGACCGA TGTCAGCCTG GGGGACGAGC 
2501 TCCACTTAGA CGGCGAGGAC 6TQGCGATGG CGCATGCCGA CGCGCTAGAC 
2551 GATTTCGATC TGGACATGTT GGQGGACGGG GATTCCCCGG GTCCGGGAH 
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2601 TACCCCCCAC GACTCCGCCC CCTACGGCGC TCTQGATATG QCCGACTTCG 
2651 AGTTTGAGCA GATGTTTACC GATGCCCTTG GAAHGACGA GTACQGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCQCGACGCC GCTCTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 

FIG.16C 
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(SEQ ID NO: 16) 

Amino acid sequence of APP(l-651)wt. K612V, GAL4-VP16(delMl) APP (664-695) 

ml pgl a1 1 1 1 aawtaral evptdgnagl 1 aepqi amf cgri nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi py rcl vgef 1 sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hclygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkwevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttslatttttttesveevvrvpttaastpd 

a vdkyl etpgdenehahf qkakerl eakhrermsqvrareweeaerqaknl pkadkka vi qhf qekvesl eqe 

aanerqql vethmarveami ndrrri al eny i tal qavpprprhvf nral kky vraeqkdrqht 1 khf ehvrmvd 

pkkaaqi rsqvrathi rviyermnqsl si lynvpavaeeiqdevdel 1 qkeqnysddvl anmi sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

jcmdaef rhdsgyevhhqkl vf f aedvgsnkgai 1 gl mvggyyiatyi vi tl vml kk kkl gtel gstspvwwns 

adi -kl 1 ssi eoacdi crl kkl kcskekpkcakcl knnwecrvsoktkrspltrahl tevesrl erl eal f 1 1 i f predl d 

6 

mi 1 kmdsl odi kal 1 tal f vadnvnkdavtdrl asvetdmpl 1 1 rohri satssseessnkgqrql tvsgi pgdl app 

tdvsl qdel hi dgedvamahadal ddf dl dml gdgdspgpqf tphdsaovgal dmadf ef eqmf tdal q1 dev 

7 8 
qqdi qhsaQa a daavtpeerhl skmqqngvenpt vkf f eqmqn 
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(SEQ ID NO: 17) 



6 
12 

18; 
24 
30 
36 
42 
48: 
54 
60 
66 
72 
78 
84 
90 
96 
102 
108 
114 
120 
126 
132 
138 
144 
150 
156 
162 
168 
174 
180 
186 
192 
198 
204 
210 
216 
222 
228 
234 
240 
246 
252 
258 
264 
270 
276 



agtttcctcg gcagcggtag gcgagagcac gcggaggagc gtgcgcgggg gccccgggag 
acggcggcgg tggcggcgcg ggcagagcaa ggacgcggcg gatcccactc gcacagcagc 
gcactcggtg ccccgcgcag ggtcgcgatg ctgcccggtt tggcactgct cctgctggcc 
gcctggacgg ctcgggcgct ggaggtaccc actgatggta atgctggcct gctggctgaa 
ccccagattg ccatgttctg tggcagactg aacatgcaca tgaatgtcca gaatgggaag 
tgggattcag atccatcagg gaccaaaacc tgcattgata ccaaggaagg catcctgcag 
tattgccaag aagtctaccc tgaactgcag atcaccaatg tggtagaagc caaccaacca 
gtgaccatcc agaactggtg caagcggggc cgcaagcagt gcaagaccca tccccacttt 
gtgattccct accgctgctt agttggtgag tttgtaagtg atgcccttct cgttcctgac 
aagtgcaaat tcttacacca ggagaggatg gatgtttgcg aaactcatct tcactggcac 
accgtcgcca aagagacatg cagtgagaag agtaccaact tgcatgacta cggcatgttg 
ctgccctgcg gaattgacaa gttccgaggg gtagagtttg tgtgttgccc actggctgaa 
gaaagtgaca atgtggattc tgctgatgcg gaggaggatg actcggatgt ctggtggggc 
ggagcagaca cagactatgc agatgggagt gaagacaaag tagtagaagt agcagaggag 
gaagaagtgg ctgaggtgga agaagaagaa gccgatgatg acgaggacga tgaggatggt 
gatgaggtag aggaagaggc tgaggaaccc tacgaagaag ccacaga^ag aaccaccagc 
attgccacca ccaccaccac caccacagag tctgtggaag aggtggttcg agttcctaca 
acagcagcca gtacccctga tgccgttgac aagtatctcg agacacctgg ggatgagaat 
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagagaatg 
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcciaaagct 
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 
gaccgaggac tgaccactcg accaggttct gggttgacaa atatcaagac ggaggagatc 
tctgaagtga agatggatgc agaattccga catgactcag gatatgaagt tcatcatcaa 
aaattggtgt tctttgcaga agatgtgggt tcaaacaaag gtgcaatcat tggactcatg 
gtgggcggtg ttgtcatagc gacagtgatc gtcatcacct tggtgatgct gaagaagaaa 
cagtacacat ccattcatca tggtgtggtg gaggttgacg ccgctgtcac cccagaggag 
cgccacctgt ccaagatgca gcagaacggc tacgaaaatc caacctacaa gttctttgag 
cagatgcaga actagacccc cgccacagca gcctctgaag ttggacagca aaaccattgc 
ttcactaccc atcggtgtcc atttatagaa taatgtggga agaaacaaac ccgttttatg 
atttactcat tatcgccttt tgacagctgt gctgtaacac aagtagatgc ctgaacttga 
attaatccac acatcagtaa tgtattctat ctctctttac attttggtct ctatactaca 
ttattaatgg gttttgtgta ctgtaaagaa tttagctgta tcaaactagt gcatgaatag 
attctctcct gattatttat cacatagccc cttagccagt tgtatattat tcttgtggtt 
tgtgacccaa ttaagtccta ctttacatat gctttaagaa tcgatggggg atgcttcatg 
tgaacgtggg agttcagctg cttctcttgc ctaagtattc ctttcctgat cactatgcat 
tttaaagtta aacattttta agtatttcag atgctttaga gagatttttt ttccatgact 
gcattttact gtacagattg ctgcttctgc tatatttgtg atataggaat taagaggata 
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2821 cacacgtttg tttcttcgtg cctgttttat gtgcacacat taggcattga gacttcaagc 
2881 ttttcttttt ttgtccacgt atctttgggt ctttgataaa gaaaagaatc cctgttcatt 
2941 gtaagcactt ttacggggcg ggtggggagg ggtgctctgc tggtcttcaa ttaccaagaa 
3001 ttctccaaaa caattttctg caggatgatt gtacagaatc attgcttatg acatgatcgc 
3061 tttctacact gtattacata aataaattaa ataaaataac cccgggcaag acttttcttt 
3121 gaaggatgac tacagacatt aaataatcga agtaattttg ggtggggaga agaggcagat 
3181 tcaattttct ttaaccagtc tgaagtttca tttatgatac aaaagaagat gaaaatggaa 
3241 gtggcaatat aaggggatga ggaaggcatg cctggacaaa cccttctttt aagatgtgtc 
3301 ttcaatttgt ataaaatggt gttttcatgt aaataaatac attcttggag gage 
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(SEQ ID NO: 18) 

MLP6LALLLLAAWTARALEVPTDGNAGLUEPQIAMFCGRLNMH 

MNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVEANQPVTIQNWCKRGR 

KQCKTHPHI^IPYRCLVGEI^SDALLVPDKCKFIilQERMDVCETHIJIWHTVAKETCSE 

KSTNLHDYGMLLPC6IDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYA 

DGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTrSIATrr 

TnTESVEEVVRVPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQV 

MREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLND 

RRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQ 

IRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISE 

PRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVD 

ARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNK 

GAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGY 

ENPTYKFFEQMQN 
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MINIMAL HIV PROMOTER 
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(SEQ ID NO: 19 AND 20) 

1 ATGffTGAGCA AGGGC6AGGA GCTGHCACC GG6GTGGTGC CCATCCTQGT 

TACCACTCGT TCCCGCTCCT CGACAAGTGG CCCCACCACG 6GTAGGACCA 
51 CGAGCTGGAC GGCGACGTAA ACGGCCACAA GHCAGCGTG TCCGGCGAGG 

GCTCGACCTG CCGCTGCATT TGCCGGTGTT CAAGTC6CAC AGGCC6CTCC 
101 GCGAGGGCGA TGCCACCTAC GGCAAGCTGA CCCTGAAGH CATCTGCACC 

CGCTCCCGCT ACGGTGGATG CCGTTCGACT GGGACTTCAA GTAGACGTGG 
151 ACCGGCAAGC TGCCCGTGCC CTGGCCCACC CTCGTGACCA CCTTCACCTA 

TG6CCGTTCG ACGGGCACGG GACCGGGTGG GAGCACTGGT GGAAGTGGAT 
201 CGGCGTGCAG TGCTTCGCCC GCTACCCCGA CCACATGAAG CAGCACGACT 

GCCGCACGTC ACGAAGCGGG CGATGGGGCT GGTGTACnC GTCGTGCTGA 
251 TCTTCAAGTC CGCCATGCCC GAAGGCTACG TCCAGGAGCG CACCATCTTC 

AGAAGTTCA6 GCGGTACGGG CTTCCGATGC AQGTCCTC6C GTGGTAGAAG 
301 7TCAA6GACG ACGGCAACTA CAAGACCCGC 6CCGAGGTGA AGTTCGAGGG 

AAGHCCTGC TGCCGTTGAT GHCIGGGCG CGGCTCCACT TCAAGCTCCC 
351 CGACACCCTG GTGAACCGCA TCGAGCTGAA GGGCATCGAC TTCAAGGAGG 

GCTGTGGGAC CACTTGGCGT AGCTCGACH CCCGTAGCTG AAGHCCTCC 
401 AC6GCAACAT CCTGGGGCAC AAGCTGGAGT ACAACTACAA CAGCCACAAG 

TGCCGTTGTA G6ACCCCGTG HCGACCTCA TGnGATGTT GTCGGTGnC 
451 GTCTATATCA CCGCCGACAA GCAGAAGAAC GGCATCAAGG TGAACTTCAA 

CAGATATAGT GGCQGCTGH CGTCTTCnG CCGTAGHCC ACTTGAAGTT 
501 GACCC6CCAC AACATCGAQG ACGGCAGCGT GCAGCTCGCC GACCACTACC 

CTGGGCGGTG TTGTAGCTCC TGCCGTCGCA CGTCGAGCGG CTGGTGATGG 
551 AGCAGAACAC CCCCATCGGC GACGGCCCCG TGCTGCTGCC CGACAACCAC 

TCGTCTTGTG GGGGTAGCCG CTGCCGGGGC ACGACGACGG GCTGnGGTG 
601 TACCTGAGCA CCCAGTCCGC CCTGAGCAAA GACCCCAACG AGAAGCGCGA 

ATGGACTCGT GG6TCAGGCG GGACTCGTTT CIGGGGHGC TCTTCGCGCT 
651 TCACATG6TC CTGCTGGAGT TCGTGACCGC CGCCGGGATC ACTCTCGGCA 

AGTGTACCAG GACGACCTCA AGCACTGGCG GCGGCCCTAG TGAGAGCCGT 
701 TGGACGAGCT GOTACAAGTAA CTC6AGTCTA GAGGQCCCGT TTAAACCCGC 

ACCTGCTCGA CATGTTCATT GAGCTCAGAT CTCCCGGGCA AATTTGGGCG 
751 TGATCAGCCT CGACTGTGCC TTCTAGnGC CAGCCATCTG nGTTTGCCC 

ACTAGTCGGA GCTGACACGG AAGATCAACG GTCGGTAGAC AACAAACGGG 
801 CTCCCCCGTG CCTTCCTTGA CCCTGGAA6G T6CCACTCCC ACTGTCCTTT 

GAG6GGGCAC GGAAGGAACT GGGACCHCC ACGGT6AGGG TGACAGGAAA 
851 CCTAATAAAA TGAGGAAATT GCATCGCAH 6TCTGAGTA6 GTGTCAnCT 

GGATTATTTT ACTCCTTTAA CGTAGCGTAA CAGACTCATC CACAGTAAGA 
901 ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG AHGGGAAGA 

TAAGACCCCC CACCCCACCC CGTCCTGTCG TTCCCCCTCC TAACCCHCT 
951 CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGCT TCTGAGGCGG 

GTTATCGTCC GTACGACCCC TACGCCACCC GAGATACCGA AGACTCCGCC 
1001 AAAGAACCAG CATGTTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA 

TTTCTTGGTC GTACACTCGT TTTCCGGTCG TTTTCCGGTC CnGGCATTT 
1051 AAGGCCGCGT TGCTGGCGTT TTTCCATA6G CTCCGCCCCC CTGACGAGCA 

nCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG GACTGCTCGT 
1101 TCAC AAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 

AGTGTTTTTA GCTGCGAGTT CAGTCTCCAC C6CTTTGGGC TGTCCTGATA 
1151 AAAGATACCA GGCGTITCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 

TTTCTATGGT CCGCAAAGGG GGACCHCGA GGGAGCACGC GAGAGGACAA 
1201 CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CHCGGGAAG 

GGCTSGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG GAAGCCCITC 
1251 CGTGGCGCTT TCTCATAGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 

6CACCGCGAA AGAGTATCGA GTGCGACATC CATAGAGTCA AGCCACATCC 
1301 TCGTTCGCTC CAAGCTG6GC TGTGT6CACG AACCCCCCGT TCAGCCCGAC 

AGCAAGCGAG GHCGACCCG ACACACGTGC TTGGGGGGCA AGTCGGGCTG 
1351 CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA 

CCGACGCGGA ATAGGCCATT GATAGCAGAA CTCAGGTT6G GCCATTCTGT 
1401 CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 

GCTGAATAGC GGTGACCGTC GTC6GTGACC AHGTCCTAA TCGTCTCGCT 
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1451 GGTATGTAGG CGGTGCTACA 6AGTTCTTGA AGTGGTGGCC TAACTACGGC 

CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG ATTGATGCCG 
1501 TACACTAGAA GAACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC 

ATGT6ATCTT CTTGTCATAA ACCATAGACG CGAGACGACT TCGGTCAATG 
1551 CrrCGGAAAA AGAGHGGTA GCTCTTGATC CG6CAAACAA ACCACC6CTG 

GAAGCCTTTT TCTCAACC AT CGAGAACTAG GCCGTrTGTT TGGTGGCGAC 
1601 GTAGCGGTGG IlillllGTr TGCAAGCAGC AGATTACGCG CAGAAAAAAA 

CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC GTCIIIIIII 
1651 GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG 

CCTAGAGTTC HCTAGGAAA CTAGAAAAGA TGCCCCAGAC TGCGAGTCAC 
1701 GAACGAAAAC TCACGTTAAG QGATrTTGGT CATGACATTA ACCTATAAAA 

CTTGCTnTG AGTGCAAnC CCTAAAACCA GTACTGTAAT TGGATATTTT 
1751 ATA6GCGTAT CACGAGGCCC TTTCGTCTCG CGCGTTTCGG TGATGACGGT 

TATCCGCATA GTGCTCCGGG AAAGCAGAGC GCGCAAAGCC ACTACTGCCA 
1801 GAAAACCTCT GACACATGCA GCTCCCGGAG ACGGTCACAG CTTGTCTGTA 

CTTTTGGAGA CTGTGTACGT CGAGGGCCTC TGCCAGTGTC GAACAGACAT 
1851 AGCGGATGCC GGGAGCAGAC AAGCCCGTCA GGGC6CGTCA GCGGGTGTTG 

TCGCCTACGG CCCTCGTCTG TTCGGGCAGT CCCGCGCAGT CGCCCACAAC 
1901 GCQGGTGfTCG GGGCTGGCTT AACTATGCGG CATCAGAGCA GAHGTACTG 

CGCCCACAGC CCC6ACCGAA TTGATACGCC GTAGTCTCGT CTAACATGAC 
1951 AGA6TGCACC ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA 

TCTCACGTQG TATACGCCAC ACTTTATOGC GTGTCTACGC ATTCCTCTn 
2001 ATACCGCATC AGGACGCGCC CTGTA6CGGC GCATTAAGCG C6GCGQGTGT 

TATGGCGTAG TCCTGCGCGG GACATCGCCG CGTAAHCGC 6CCGCCCACA 
2051 GGTGGTTACG CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG 

CCACCAATGC GCGTCGCACT GGCGATGTGA ACGGTCGCGG GATCGCGGGC 
2101 CTCCTTTCGC TTTCTTCCCT TCCTTTCTCG CCACGTfCGC C6GCTTTCCC 

GAGGAAAGCG AAAGAAGGGA AGGAAAGAGC GGTGCAAGCG GCCGAAAGGG 
2151 CGTCAAGCTC TAAATCGGGG GCTCCCnTA GGGTTCCGAT nAGIGCITT 

GCAGHCGAG ATTTAGCCCC CGAGGGAAAT CCCAAGGCTA AATCACGAAA 
2201 ACGGCACCTC 6ACCCCAAAA AACTTGATTA GGGTGATGGT TCACGHTAGIG 

TGCCGTCGAG CTGGGGTTTT TT6AACT AAT CCCACTACCA AGTGCATCAC 
2251 GGCCATCGCC CTGATAGACG GTmTCGCC CnTGACGTT GGAGTCCACG 

CCGGTAGCGG GACTATCTGC CAAAAAGCGG GAAACTGCAA CCTCAQGT6C 
2301 TTCTTTAATA GTGGACTCTT GHCCAAACT GGAACAACAC TCAACCCTAT 

AAGAAATTAT CACCTGAGAA CAAGGTTTGA CCTTGTTGTG AGTTGGGATA 
2351 CTCGGTCTAT TCTnTGATT TATAAGGGAT TTTGCCGATT TCGGCCTATT 

GAGCCAGATA AGAAAACTAA ATATTCCCTA AAACGGCTAA AGCCGGATAA 
2401 GG HAAAAAA TGAGCTGATT TAAC AAAAAT TTAACGCGAA TTTTAACAAA 

CCAATTTTTT ACTCGACTAA AnGTnTTA AATTGCGCTT AAAATTGTfT 
2451 ATAHAACGC TTACAATnC CAHCGCCAT TCAGGCTGAA CTAGATCTAG 

TATAAHGCG AATGHAAAG GTAAGCGGTA AGTCCGACTT GATCTAGATC 
2501 AGTCCGHAC ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC 

TCAGGCAATG TATTGAATGC CATHACCGG GCGGACCGAC TGGCGGGTTG 
2551 GACCCCCGCC CATTGACGTC AATAATGACG TATGTTCCCA TAGTAACGCC 

CTGGG6GCGG GTAACTGCAG TTAnACTGC ATACAAGGGT ATCATTGCGG 
2601 AATAGGGACT TTCCATTGAC GTCAATGGGT GGAGTATTTA CGGTAAACTG 

TTATCCCTGA AAGGTAACTG CAGHACCCA CCTCATAAAT GCCATTTGAC 
2651 CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT 

GGGTGAACCG TCATGTAGTT CACATAGTAT ACGGTTCATG CGGGGGATAA 
2701 GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 

CreCAGTTAC TGCCATTTAC CGGGCGGACC GTAATACGGG TCATGTACTG 
2751 CTTATGGGAC TTTCCTACn GGCAGTACAT CTACGTATTA GTCATCGCTA 

GAATACCCTG AAAGGATGAA CCGTCATGTA GATGCATAAT CAGTAGCGAT 
2801 TTACCATGGT GATGCGGITT TGGCAGTACA TCAATGGGCG TGGATAGCGG 

AATGGTACCA CTACGCCAAA ACCGTCATGT AGTTACCCGC ACCTATCGCC 
2851 TTTGACTCAC G6GGATTTCC AAGTCTCCAC CCCATTGACG TCAATGGGAG 

AAACTGAGTG CCCCTAAAGG HCAGAGGTG GGGTAACTGC AGTTACCCTC 
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2901 ITTGTnTGG CAC CAAMT C AACGGGACTT TCCAAAATGT CGTMCAACT 

AAACAAAACC GTGGTnTAG TTGCCCTGAA AGGTITTACA GCATTGTTGA 
2951 CCGCCCCAH GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT 

GGC6GGGTAA CTGCGITTAC CCGCCATCCG CACATGCCAC CCTCCAGATA 
3001 ATAAGCAGAG CTCGITTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC 
^ TATTCGTCTC GAGCAAATCA CTTSGCAGTC TAGCGGACCT CTGCGGTAGG 
3051 ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG 

TGCGACAAAA CTGGAGGTAT CnCTGTGGC CCTGGCTAGG TCGGAGGCGC 
3101 GCCGGGAACG GTGCAnGGA ACQGACCGTG TTGACAAHA ATCATC6GCA 

CGGCCCTTGC CACGTAACCT TGCCTGGCAC AACTGHAAT TAGTAGCCGT 
3151 TAGTATATCG GCATAGTATA ATACGACAAG GTGA6GAACT AAACCATGGC 

ATCATATAGC C6TATCATAT TATGCTGHC CACTCCTTGA TTTGGTACCG 
3201 CAAGTTGACC AGTGCCGTTC CGGTGCTCAC CGCGCGCGAC GTCGCC6GAG 

GTTCAACTGG TCACGGCAAG GCCACGAGTG GCGCGCGCT6 CA6CGGCCTC 
3251 CGGTCGAGH CTGGACCGAC CGGCTCGGGT TCTCCCGGGA CnCGTGGAG 

GCCAGCTCAA GACCTGGCTG 6CCGAGCCCA AGAGGGCCCT GAAGCACCTC 
3301 GACGACnCG CCGGTGTGGT CCGGGACGAC GTGACCCTGT TCATCAGCGC 

CTGCTGAAGC GGCCACACCA GGCCCTGCTG CACTGGGACA AGTAGTCGCG 
3351 GGTCCAGGAC CAGGTGGTGC CGGACAACAC CCTGGCCTGG GTGTGGGTGC 

CCAGGTCCTG GTCCACCAC6 GCCTGTTGTG GGACCGGACC CACACCCACG 
3401 GCGGCCTGGA CGAGCTGTAC GCCGAGTGGT CGGAGGTCGT GTCCACGAAC 

CGCCGGACCT GCTCGACATG CGGCTCACCA GCCTCCAGCA CAGGTGCTTG 
3451 TTCCQGGACG CCTCCG66CC GGCCATGACC GAGATCGGCG AGCAGCCGTG 

AAGGCCCTGC GGAGGCCCGG CCGGTACTGG CTCTAGCCGC TCGTC6GCAC 
3501 GGGGCGGGAG HCGCCCTGC GCGACCCGGC CGGCAACTGC GTGCACnCG 

CCCCGCCCTC AAGCGGGACG CGCTGGGCCG GCCGTTGACG CACGTCAAGC 
3551 TGGCCGAGGA GCAGGACTGA CACTCGACCT CGAAACTTGT TTATTGCAGC 

ACCGGCTCCT CGTCCTGACT GTGAGCTGGA GCTTTGAACA AATAACGTCG 
3601 TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 

A ATATTACCA ATGnTATTT CGHATCGTA GTGTTTAAAG TGTTTATTrC 
3651 CAI 1 1 1 1 1 IC ACTGCAHCT AGTTGTGGn TGTCCAAACT CATCAATGTA 

GTAAAAAAAG TGACGTAAGA TCAACACCAA ACAGGTTTGA GTAGTTACAT 
3701 TCTTATCATG TCTGGATCGA TACTTCAAGA ACTGCTGACA TCGAGCTTGC 

A6AATAGTAC AGACCTAGCT ATGAAGHCT TGACGACTGT AGCTCGAACG 
3751 TACAAGGGAC TTTCCGCTGG GGACTTTCCA GGGAGGCGTG GCCTGGGCGG 

ATGHCCCTG AAAGGC6ACC CCTGAAAGGT CCCTCCGCAC CGGACCCGCC 
3801 GACTGGGGAG TGGCGAGCCC TCAGATCCTG CATATAAGCA GCTGCTTTTT 

CTGACCCCTC ACCGCTCGGG AGTCTAGGAC GTATAHCGT CGACGAAAAA 
3851 GCCT6TACTG GGTCTCTCTG GHAGACCAG ATCTGAGCCT GGGAGCTCTC 

CGGACATGAC CCAGAGAGAC CAATCTGGTC TAGACTCGGA CCCTCGAGAG 
3901 TGGCTAACTA GGGAACCCAC TGCTTAAGCC TCAATAAAGC TTGGTACCGA 

ACC6ATTGAT CCCTTGGGTG ACGAAHCGG AGTTATTTCG AACCATGGCT 
3951 GCTCGGATCC GAATTCGCCA CC 

CGAGCCTAGG CTTAAGCGGT GG 
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(SEQ ID NO: 21 AND 22) 

1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGTTAG 
51 T6CTCTGATG CCGCATAGH AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 

AC6AGACTAC GGCGTATCAA HCGGTCATA GAC6AGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT HAAGCTACA ACAAGGCAAG 

CCTCCAGCGA CTCATCACGC QCTCGmTA AATTCGATGfT TGTTCCGTTC 
151 GCTTGACCGA CAATTGCATG AAGAATCT6C HAGGGITAG GCGTTTTGCG 

CGAACTGGCT GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCnCGCG ATGTACGGGC CAGATATACG CGTTGACATT GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTAA CTAATAACTG 
251 TAGTTAnAA TAGTAATCAA TTACGGGGTC ATTAGnCAT AGCCCATATA 

ATCAATAATT ATCATTAGn AATGCCCCAG TAATCAAGTA TCGGGTATAT 
301 TGGAGTTCCG CGTTACATAA CHACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAAIGTAH GAATGCCAH TACCGGGCGG ACCGACTGGC 
351 CCCAACGACC CCCGCCCAH GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGHGCIGG 6GGCGGGTAA CTGCAGTTAT TACT6CATAC AAGG6TATCA 
401 AACGCCAATA GGGACTTTCC AHGACGTCA ATGG6TGGAC TATTTACGGT 

nGCGGHAT CCCTGAAAQG TAACTGCAGT TACCCACCTG ATAAATGCCA 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATAC6G HCATGCGGG 
501 CCTATTGACG TCAATGACGG TAAATGGCCC 6CCTGGCATT ATGCCCAGTA 

GGATAACTGC AGTTACTGCC ATTTACCGGG CGGACCGTAA TACQGGTCAT 
551 CATGACC1TA TGGGACnTC CTACTTGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTrTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCC6CACCT 
651 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGHCA GAGGTGGGGT AACTGCAGH 
701 TGGGAGITTG TTTrGGCACC AAAATCAACG GGAGTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG TTTTAGITGC CCTGAAAGGT TTTACA6CAT 
751 ACAACTCCGC CCCAHGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTCTATATAA GCAGAGCTCT CT6GCTAACT AGAGAACCCA CTGCTTACTG 

CAGATATATT CGTCTCGAGA GACCGATTGA TCTCnGGGT GACGAATGAC 
851 GCHATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTA6C 

CGAATAGCTT TAATTATGCT GAGTGATATC CCTCIGGGH CGACCGATCG 
901 GTTTAAACTT AAGCHCCCC 6CGCAGGGTC GCGATGCTGC CCGGTTTGGC 

CAAATTTGAA TTCGAAGGGG CGCGTCCCAG CGCTACGACG GGCCAAACCG 
951 ACTGCTCCTG CTGGCCGCCT GGACGGCTCG GGCGCTGGAG GTACCCACTG 

TGACGAGGAC GACCGGCGGA CCTGCCGAGC CCGCGACCTC CATGGGTGAC 
1001 ATGGTAATGC TGGCCTGCTG GCTGAACCCC AGATTGCCAT GTTCTGTGGC 

TACCATTACG ACCGGACGAC CGACTTGGGG TCTAACGGTA CAAGACACCG 
1051 AGACTGAACA TGCACATGAA TGTCCAGAAT GGGAAGTGGG ATTCAGATCC 

TCTGACTTGT ACGTGTACTT ACAGGTCTTA CCCHCACCC TAAGTCTA6G 
1101 ATCAGGGACC AAAACCTGCA TTGATACCAA GGAAGGCATC CTGCAGTATT 

TAGTCCCTGG TnTGGACGT AACTATGGTT CCHCCGTAG GACGTCATAA 
1151 GCCAAGAAGT CTACCCT6AA CTGCAGATCA CCAATGTGGT AGAAGCCAAC 

CGGHCTTCA GATGGGACTT GACGTCTA6T GGHACACCA TCTrCGGHG 
1201 CAACCAGTGA CCATCCAGAA CTGGTGCAAG CGGGGCCGCA AGCAGTGCAA 

GnGGTCACT GGTAGGTCTT GACCACGHC GCCCCGGCGT TCGTCACGIT 
1251 GACCCATCCC CACTTTGTGA HCCCTACCG CTGCHAGIT GGIGAGITTA 

CTGGGTAGGG GTGAAACACT AAGGGATGGC GACGAATCAA CCACTCAAAT 
1301 TAAGTGATGC CCTTCTCGn CCTGACAA6T GCAAAnCTT ACACCAGGA6 

ATTCACTACG GGAAGAGCAA GGACTGTTCA CGTTTAAGAA TGTGGTCCTC 
1351 AGGATGGATG TTTGCGAAAC TCATCTTCAC TGGCACACCG TCGCCAAAGA 

TCCTACCTAC AAACGCTTTG AGTAGAAGTG ACCGTGTGGC AGCGGTTTCT 



FIG.26B 



wo 03/072041 PCT/US03/05458 



48/76 

1401 GACAT6CAGT GAGAAGAGTA CCAACHGCA TGACTACGGC ATGTTGCTGC 

CTGTACGTCA CTCTTCTCAT G6TT6AACGT ACTGATGCCG TACAACGACG 
1451 CCTGC6GAAT TGACAAGTTC CGAGGGGTAG AGTTTGTGTG HGCCCACTG 

GGACGCCHA ACTGHCAAG GCTCCCCATC TCAAACACAC AACGGGTGAC 
1501 6CTGAAGAAA GTGACAATGT GGAHCTGCT GATGCGGAGG AGGATGACTC 

CGACTTCTTT CACTGTTACA CCTAAGACGA CTACGCCTCC TCCTACTGAG 
1551 GGATGTCTGG TGGGGCGGAG CAGACACAGA CTATGCAGAT GGGAGTGAAG 

CCTACAGACC ACCCCGCCTC GTCTGTGTCT GATACGTCTA CCCTCACTTC 
1601 ACAAAGTAGT AGAAGTAGCA GAGGAGGAAG AAGTGGCTGA 6GTGGAAGAA 

TGTTTCATCA TCTTCATCGT CTCCTCCTTC HCACCGACT CCACCTTCTT 
1651 GAAGAAGCCG ATGATGACGA GGACGATGAG GATGGTGATG AGGTAGAGGA 

CncnCGGC TACTACTGCT aiGCTACTC CTACCACTAC TCCATCTCCT 
1701 AGAGGCTGAG GAACCCTACG AAGAAGCCAC AGAGAGAACC ACCAGCAHG 

TCTCCGACTC CTTQGGAT6C TTCTTCGGTG TCTCTCTT6G TGGTCGTAAC 
1751 CCACCACCAC CACCACCACC ACAGAGTCTG TGGAAGAGGT GGTTCGAGn 

GGTQGTGGfTG GTGGTGGTGG TGTCTCAGAC ACCTTCTCCA CCAAGCTCAA 
1801 CCTACAACAG CAGCCA6TAC CCCT6ATGCC GTTGACAAGT ATCTCGAGAC 

GGATGHGTC GTCG6TCATG QGGACTACGG CAACTGHCA TAGAGCTCTG 
1851 ACCTGGGGAT GAGAATGAAC ATGCCCATTT CCAGAAAGCC AAAGAGAGGC 

TQGACCCCTA CTCTTACTTG TACGGGTAAA GGTCTTTCQG TITCTCTCCG 
1901 TTGAGGCCAA GCACCGAGAG AGAATGTCCC AGGTCATGAG AGAATGGGAA 

AACTCCGGH CGTGGCTCTC TCHACAGGG TCCAGTACTC TCnACCCTT 
1951 GAGGCAGAAC GTCAAGCAAA GAACTTGCCT AAAGCTGATA AGAA6GCAGT 

CTCCGTCTTG CAGnCGITT CTTGAACGGA TTTCGACTAT TCHCCGTCA 
2001 TATCCAGCAT TTCCAGGA6A AAGTGGAATC TTTGGAACAG GAAGCAGCCA 

ATAGGTCGTA AAGGTCCTCT nCACCTTAG AAACCTTGTC CTTCGTC6GT 
2051 ACGAGAGACA GCAGCTQGTG GAGACACACA TGGCCAGAGT GGAAGCCATG 

TGCTCTCTGT CGTCGACCAC CTCTGTGTGT ACC6GTCTCA CCTTCGGTAC 
2101 CTCAATGACC GCCGCCGCCT GGCCCTGGAG AACTACATCA CCGCTCTGCA 

GAGTTACTGG CGGCG6C6GA CCGGGACCTC TTGATGTAGT GGCGAGACGT 
2151 GGCTGTTCCT CCTCQGCCTC GTCACGTGTT CAATATGCTA AAGAAGTATG 

CC6ACAAGGA GGAGCCGGAG CAGTGCACAA GTTATACGAT TTCTTCATAC 
2201 TCC6CGCAGA ACAGAAGGAC AGACAGCACA CCCTAAAGCA TTTCGAGCAT 

AG6CGCGTCT TGTCTTCCTG TCTGTC6TGT GGGATTTCGT AAAGCTCGTA 
2251 GTGCGCATGG T6GATCCCAA GAAAGCCGCT CAGATCCGGT CCCAGGHAT 

CACGCGTACC ACCTAGGGTT CTTTCGGCGA GTCTAGGCCA GGGTCCAATA 
2301 GACACACCTC CGTGTGATnr ATGAGCGCAT GAATCAGTCT CTCTCCCTGC 

CTGTGTGGA6 GCACACTAAA TACTCGCGTA OTAGTCAGA GAGAGGGACG 
2351 TCTACAACGT GCCTGCA6TG GCCGAGGAGA HCAGGATGA AGHGATGAG 
• AGATGHGCA CGGACGTCAC CGGCTCCTCT AAGTCCTACT TCAACTACTC 
2401 CTGCnCAGA AAGAGCAAAA CTATTCAGAT GACGTCnGG CCAACATGAT 

GACGAAGTCT TTCTCGTTTT GATAAGTCTA CTGCAGAACC GGHGTACTA 
2451 TAGTGAACCA AGGATCAGTT ACQGAAACGA TGCTCTCATG CCATCTTTGA 

ATCACnGGT TCCTAGTCAA TGCCTTTGCT ACGAGAGTAC GGTAGAAACT 
2501 CCGAAACGAA AACCACCGTG GAGCTCCTTC CCGTGAATGG AGAGTTCAGC 

GGCrrTGCTT TTGGTGGCAC CTCGAGGAAG GGCACTTACC TCTCAAGTCG 
2551 CTGGACGATC TCCAGCCGTG GCAnCTTTT G6GGCTGACT CTGTGCCAGC 

GACCTGCTAG AGGTCGGCAC CGTAAGAAAA CCCCGACTGA GACACGGTCG 
2601 CAACACAGAA AACGAAGTTC AGCCTGTTGA TGCCCGCCCT 6CTGCCGACC 

GTTGTGTCTT TTGCTTCAAC TCGGACAACT AC6GGCGGGA CGACGGCT6G 
2651 GAGGACTGAC CACTCGACCA GGHCIGGGT TGACAAATAT CAAGACGGAG 

CTCCTGACTG GTGAGCTGGT CCAAGACCCA ACTGTTTATA GHCTGCCTC 
2701 GAGATCTCTG AAGTGAATCT AGATGCAGAA TTCCGACATG ACTCAGGATA 

CTCTAGAGAC TTCACTTAGA TCTACGTCTT AAGGCTGTAC T6AGTCCTAT 
2751 TGAAGHCAT CATC AAAAAT TGGTGTTCTT TGCAGAAGAT GTGGGTTCAA 

ACnCAAGTA GTAGTTTTTA ACCACAAGAA ACGTCTTCTA CACCCAAGTT 
2801 ACAAAGGTGC AATCATTGGA CTCAT6GTGG GCGGIGHGT CATAGCGACA 

T6TTTCCACG TTAGTAACCT GAGTACCACC CGCCACAACA GTATCGCTGT 
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2851 GTGATCGTCA TCACCTTGGT 

CACTAGCAGT AGTGGAACCA 
2901 AGTA6ATCCT AGACTAGAGC 

TCATCTAGGA TCTGATCTCG 
2951 CTGCHGTAC CAATTGCTAT 

GACGAACATG GTTAACGATA 
3001 TGTnCATGA CAAAAGCCTT 

ACAAAGTACT GTTrTCGGAA 
3051 ACAGCGACGA AGAGCTCATC 

TGTCGCTGCT TCTCGAGTAG 
3101 CAAAGCAGTA AGTAGGCGGC 

GTUCGTCAT TCATCCGCCG 
3151 GCTGATCAGC CTCGACTGTG 

CGACTAGTCG GAGCTGACAC 
3201 CCCTCCCCCG TGCCTTCCTT 

GGGAGGGGGC ACGGAAGGAA 
3251 TTCCTAATAA AATGAGGAAA 

AAGGATTAH TTACTCCnT 
3301 CTAnCTGGG GGGTGGGGTG 

GATAAGACCC CCCACCCCAC 
3351 GACAATAGCA GGCATGCTGG 

CTGTTATCGT CCGTACGACC 
3401 GGAAAGAACC AGCTGGGGCT 

CCTTTCTTGG TCGACCCCGA 
3451 GCGCATTAAG CGCGGCGGGT 

CGCGTAATTC GCGCCGaCA 
3501 CTTGCCAGCG CCCTAGCGCC 

GAACGGTCGC GGGATCGCGG 
3551 CGCCACGTTC GCCGGCTTTC 

GC6GTGCAAG CGGCCGAAAG 
3601 TAGGGTTCCG ATTTAGTGCT 

ATCCCAAQGC TAAATCACGA 
3651 TAGGGTGATG GTTCACGTAG 

ATCCCACTAC CAAGTGCATC 
3701 CCCTTTGACG TTOGAGTCCA 

GG6AAACTGC AACCTCAGGT 
3751 CTGGAACAAC ACTCAACCCT 

GACCnGTTG TGAGTTGGGA 
3801 ATnTGGGGA TTTCGQCCTA 

TAAAACCCCT AAAGCCGGAT 
3851 ATITAACGCG AATTAATTCT 

TAAATTGCGC TTAATTAAGA 
3901 GTCCCCAGGC TCCCCAGGCA 

CAGGGGTFCCG AGGGGTCCGT 
3951 TAGTCAGCAA CCAGGT6TGG 

ATCAGTCGH 6GTCCACACC 
4001 TATGCAAAGC ATGCATCTCA 

ATACGTTTCG TACGTAGAGT 
4051 CTCCGCCCAT CCCGCCCCTA 

GAGGCGGGTA GGGCGGGGAT 
4101 CATGGCT6AC TAAI 1 1 1 1 1 1 

GTACCGACTG ATTAAAAAAA 
4151 CTCTGAGCTA TTCCAGAAGT 

GAGACTCGAT AAGGTCTTCA 
4201 nTGCAAAAA GCTCCC6GGA 

AAACGTmr CGAGGGCCCT 
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GATGCTGAAG AAGAAAGATA TCATGGAGCC 
CTACGACnC TTCTTTCTAT AGTACCTCGG 
CCTGGAAGCA TCCAGGAAGT CAGCCTAAAA 
GGACC7TCGT AGGTCCTTCA GTCGGATTTT 
TG TAAAAAG T GnGCTTTCA TTGCCAAGH 
ACATTTTTCA CAACGAAAGT AACGGTTCAA 
AGGCATCTCC TATGGCAGGA AGAAGCGGAG 
TCCGTAGAGG ATACCGTCCT TCTTCGCCTC 
AGAACAGTCA GACTCATCAA GCTTCTCTAT 
TCTTGTCAGT CTGAGTAGTT CGAA6AGATA 
CGCTCGAGTC TAGAGGGCCC GTTTAAACCC 
GCGAGCTCAG ATCTCCCGGG CAAATTTGGG 
CCnCTAGTT GCCAGCCATC TGnGTTTGC 
GGAAGATCAA CGGTCGGTAG ACAACAAACG 
GACCCTGGAA GGTGCCACTC CCACTGTCCT 
CTG6GACCTT CCACGGTGAG GGTGACAGGA 
HGCATCGCA HGTCTGAGT AGGIGTCAH 
AACGTAGCGT AACAGACTCA TCCACAGTAA 
GGGCAGGACA GCAAGGGGGA GGATTGGGAA 
CCCGTCCTGT CGTTCCCCCT CCTAACCCn 
GGATGCGGTG GGCTCTATGG CTTCTGAGGC 
CCTACGCCAC CCGAGATACC GAAGACTCCG 
CTA6GGGGTA TCCCCACGCG CCCTGTAGCG 
GATCCCCCAT AGGGGT6CGC GGGACATCGC 
GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
CACCACCAAT GCGCGTCGCA CTGGCGATGT 
CGCTCCTTTC GCTTTCnCC CnCCTTTCT 
GCGAGGAAAG CGAAAGAAGG GAAGGAAAGA 
CCCGTCAAGC TCTAAATCGG GGCATCCC7T 
GGGCAGTTCG AGATTTAGCC CCGTAGGGAA 
TTACGGCACC TCGACCCCAA AAAACHGAT 
AATGCCGTGG AGCtOGGGTT TTT TGAAC TA 
TG6GCCATCG CCCTGATAGA CGGTmTCG 
ACCCQGTAGC GGGACTATCT GCCAAAAAGC 
CGnCTTTAA TAGTG6ACTC nGTTCCAAA 
GCAAGAAAH ATCACCTGAG AACAAGGTTT 
ATCTCGGTCT ATrCTTiTGA TTTATAAGGG 
TAGAGCCAGA TAAGAAAACT AAATATTCCC 
TTGGTTAAAA AATGAGCTGA TTTAACAAAA 
AACCAATm TTACTCGACT AAAnGTm 
GTGGAATGTG TGTCAGTTAG GGTGTGGAAA 
CACCTTACAC ACAGTCAATC CCACACCTTT 
GGCAGAAGTA TGCAAAGCAT GCATCTCAAT 
CCGTCTTCAT ACGTTTCGTA CGTAGAGTTA 
AAAGTCCCCA GGCTCCCCAG CAGGCA6AAG 
TTTCAGGGGT CCGAGGGGTC GTCCGTCTTC 
AHAGTCAGC AACCATAGTC CCGCCCCTAA 
TAATCAGTCG TTGGTATCAG GGCGGGGATT 
ACTCCGCCCA GHCCGCCCA TTCTCCGCCC 
TGAGGCGGGT CAAGGCGGGT AAGAGGCGGG 
TATTTATGCA GAGGCCGAGG CCGCCTCTGC 
ATAAATACGT CT CCGGCT CC GGCGGAGACG 
AGTGAGGA6G CIIIIIIGGA GGCCTAGGCT 
TCACTCCTCC GAAAAAACCT CCGGATCCGA 
GCTTGTATAT CCATTTTCGG ATCTGATCAG 
CGAACATATA GGTAAAAGCC TA6ACTAGTC 
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4251 CACGTGTTGA CAAHAATCA TC6GCATA6T ATATCGGCAT AGTATAATAC 

GTGCACAACT GTTAATTAGT AGCCGTATCA TATAGCCGTA TCATAHATG 
4301 GACAAGGTCA GGAACTAAAC CATGGCCAAG TTGACCAGTG CCGmCCGGT 

CTGTTCCACT CCnGATTTG GTACCGGTTC AACT6GTCAC 6GCAAGGCCA 
4351 GCTCACCGCG CGCGACGTCG CCGGAGCGGT CGAGHCIGG ACCGACCGGC 

CGAGTQGCGC GCGCTGCAGC GGCCTCGCCA GCTCAAGACC TGGCTGGCCG 
4401 TCGGGTTCTC CCGGGACHC GTGGAGGACG ACTTCGCCGG TGTGGTCC6G 

AGCCCAAGAG GGCCCTGAAG CACCTCCTGC TGAAGCGGCC ACACCAGGCC 
4451 GACGACGTGA CCCTGTTCAT CAGCQC6GTC CAGGACCAGG TGGTGCCGGA 

CTGCTGCACT GGGACAAGTA GTCGCGCCAG GTCCTGGTCC ACCACGGCCT 
4501 CAACACCCTG GCCTGGGTGT GGGTGCGCGG CCTGGACGAG CTGTACGCCG 

GHGTGGGAC CGGACCCACA CCCACGCGCC GGACCTGCTC GACATGCGGC 
4551 AGTGGTCGGA Q6TCGTGTCC ACGAACTTCC GGGACGCCTC CGGGCCGGCC 

TCACCAGCCT CCAGCACAGG TGCHGAAGG CCCTGCGGAG GCCCGGCCGG 
4601 ATGACCGAGA TCGGCGAGCA GCCGTGGGGG CGGGAGHCG CCCTGCGC6A 

TACTGGCTCT AGCCGCTC6T CGGCACCCCC GCCCTCAA6C GGGACGCGCT 
4651 CCCGGCCGGC AACTGCGTGC ACnCGTGGC CGAGGA6CAG GACTGACACG 

GGGCCGGCCG TTGAC6CACG TGAAGCACCG GCTCCTCGTC CTGACTGTGC 
4701 TGCTACGAGA TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC 

ACGATGCTCT AAAGCTAAGG TGGCGGCGGA AGATACTTTC CAACCC6AAG 
4751 GGAATCGTTT TCCGGGACGC CGGCTGGATG ATCCTCCAGC GCG6GGATCT 

CCTTAGCAAA AGGCCCTGCG GCCGACCTAC TAGGAGGTCG CGCCCCTAGA 
4801 CATGCTGGAG TTCTTCGCCC ACCCCAACH GTTTAnGCA GCTTATAATG 

GTACGACCTC AA6AAGCGGG TGGG6TTGAA CAAATAACGT CGA ATATTAC 
4851 GHACAAATA AAGCAATAGC ATCACAAAH TCACAAATAA AGCATTTTTT 

CAATGITTAT TTCGTTATCG TAGTGTTTAA AGTGTTTAn TCGTAAAAAA 
4901 TCACTGCATT CTAGT7GTGG TTTGrCCAAA CTCATCAATG TATCTTATCA 

AGTGACGTAA GATCAACACC AAACAGGTIT GAGTAGTTAC ATAGAATAGT 
4951 TGTCTGTATA CCGTC6ACCT CTAGCTAGAG CTT6GCGTAA TCATGGTCAT 

ACAGACATAT G6CAGCTGGA GATCGATCTC 6AACCGCATT AGTACCAGTA 
5001 AGCTGTTTCC TGTGTGAAAT TG7TATCCGC TCACAATTCC ACACAACATA 

TCGACAAAGG ACACACTTTA ACAATAGGCG AGTGHAAGG TGTGTTGTAT 
5051 CGAGCCGGAA GCATAAAGTG TAAAGCCT6G GGT6CCTAAT GAGTGAGCTA 

GCTCGGCCn CGTATTTCAC ATTTCGGACC CCACGGATTA CTCACTCGAT 
5101 ACTCACATTA AnGCGTTGC 6CTCACTGCC CGCTTTCCAG TCGGGAAACC 

TGAGTGTAAT TAACGCAACG CGAGTGACGG GCGAAAGGTC AGCCCTTTGG 
5151 TGTCGTGCCA GCTGCATTAA TGAATCGGCC AACGCGCG6G 6AGAGGCGGT 

ACAGCACGGT CGACGTAAH AC7TAGCCGG HGCGCGCCC CTCTCCGCCA 
5201 TTGCGTAnG GGCGCTCHC CGCTTCCTCG CTCACTGACT CGCTGCGCTC 

AACGCATAAC CCGCGAGAA6 GCGAAGGAGC 6AGTGACTGA GCGACGCGAG 
5251 GGTCGTTCGG CTGCGGCGAG CGGTATCAGC TCACTCAAAG GCGGTAATAC 

CCAGCAAGCC GACGCCGCTC GCCATAGTCG AGTGAG7TTC CGCCAHATG 
5301 GGTTATCCAC AGAATCAGGG GATAAC6CAG GAAAGAACAT GTGAGCAAAA 

CCAATAQGTG TCTTAGTCCC CTAHGCGTC CTTTCnGTA CACTCGTnT 
5351 GGCCAGCAAA A6GCCAGGAA CCG TAAAAAG GCCGCGTT6C TGGCGTTTTT 

CCGGTCGTTT TCCGGTCCH GGCATTTTTC CGGCGCAACG ACC6CAAAAA 
5401 CCATAGGCTC CGCCCCCCTG ACGAGCATCA C AAAAAT CGA CGCTCAA6TC 

G6TATCCGAG GCGGGG6GAC TGCTCGTAGT GTmTAGCT GCGAGTTCAG 
5451 AGAG6TGGCG AAACCCGACA GGACTATAAA GATACCAGGC GTTTCCCCCT 

TCTCCACCGC TTrQGGCTGT CCTGATATTT CTATGGTCCG CAAAGGGGGA 
5501 GGAAGCTCCC TCGTGCGCTC TCCTEnTCCG ACCCTGCCGC TTACCGGATA 

CCnCGAGGG AGCACGCGAG AGGACAAGGC TGGGACGGC6 AATGGCCTAT 
5551 CCTGTCCGCC TTTCTCCCn CGGGAAGCGT GGCGCTTTCT CAATGCTCAC 

GGACAGGCGG AAAGAGGGAA GCCCTTCGCA CCGCGAAAGA GTTACGAGTG 
5601 GCTGTAGGTA TCTCAGHCG GTGTAGGTCG HCGCTCCAA GCTGGGCTGT 

CGACATCCAT AGAGTCAAGC CACATCCAGC AAGCGAGGTT CGACCCGACA 
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5651 GTGCACGAAC CCCCCGTTCA GCCCGACCGC TQCGCCTTAT CCGGTAACTA 

CACGTGCTTG GGGGGCAAGT CGGGCTGGCG AC6CGGAATA QGCCATTGAT 
5701 TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG 

AGCAGAACTC AGGnGGGCC AHCTGIGCT GAATAGCGGT 6ACCGTCGTC 
5751 CCACTGGTAA CAGGATTAGC AGAGC6AGGT AT6TAGGCGG TGCTACA6AG 

GGTGACCATT GTCCTAATCG TCTCGCTCCA TACATCC6CC ACGATGTCTC 
5801 nCHGAAGT GGTCGCCTAA CTACGGCTAC ACTAGAAGGA CAGTATnGG 

AAGAACnCA CCACCGGATT GATGCCGATG TGATCTTCCT GTCATAAACC 
5851 TATCTGCGCT CTGCT6AAGC CAGTTACCn CG GAAAAA GA GTTG6TAGCT 

ATAGACGCGA 6ACGACTTCG GTCAATGGAA GCCTTTTTCT CAACCATCGA 
5901 CTTGATCCQG CAAACAAACC ACC6CTGGTA GCGGTGGTTT TTTTGTTTGC 

GAACTAGGCC GTTTGTTTGG TGGCGACCAT CGCCACCAAA AAAACAAACG 
5951 AAGCAGCAGA HACGCGCAG AAAAAAAG GA TCTGAAGAAG ATCCTFTGAT 

nCGTCGTCT AATGCGCGTC IIIIIIICCT AGAGnCTTC TAGGAAACTA 
6001 CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA CGTTAAGGGA 

GAAAAGAT6C CCCAGACT6C GAGTCACCTT GCTTTTGAGT GCAATTCCCT 
6051 TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT 

AAAACCAGTA CTCTAATAGT TTTTCCTAGA AGTGGATCTA GGAAAATTTA 
6101 TAAAAAT GAA GTnTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC 

AimTACTT CAAAATTTAG 7TAGATTTCA TATATACTCA TTTGAACCAG 
6151 TGACAGTTAC CAATGCHAA TCAGTGAGGC ACCTATCTCA GCGATCTGTC 

ACTGTCAATG GTTACGAATT AGTCACTCCG TGGATAGAGT CGCTAGACAG 
6201 TATTrCGTTC ATCCATAGTT GCCT6ACTCC CCGTC6TGTA GATAACTACG 

ATAAAGCAAG TAGGTATCAA C6GACTGAGG 6GCAGCACAT CTATTGATGC 
6251 ATACGGGAGG GCTTACCATC TGGCCCCAGT GCTGCAATGA TACCGCGAGA 

TATGCCCTCC CGAAT6GTAG ACCGGGGTCA CGACGTTACT ATGGCGCTCT 
6301 CCCACGCTCA CCGGCTCCAG ATTTATCAGC AATAAACCAG CCAGCCGGAA 

GGGTGCGAGT GGCCGAGGTC TAAATAGTCG TTATTTGGTC GGTCGGCCTT 
6351 GGGCCGAGCG CAGAAGTGGT CCTGCAACH TATCCGCCTC CATCCAGTCT 

CCCGGCTCGC GTCHCACCA GGACGTTGAA ATAGGCGGAG GTAGGTCAGA 
6401 ATTAATTGn GCCGGGAAGC TAGAGTAAGT AGTTCGCCAG TTAATAGTTr 

TAATTAACAA CGGCCCTTCG ATCTCATTCA TCAA6CGGTC AATTATCAAA 
6451 GCGCAACGTT GTTGCCATTG CTACAGGCAT CGTGGTGTCA CGCTCGTCGT 

CGCGHGCAA CAACGGTAAC GATGTCCGTA GCACCACAGT GC6AGCAGCA 
6501 TTGGTATGGC nCATTCAGC TCCGGITCCC AACGATCAAG GCGAGTTACA 

AACCATACCG AAGTAAGTCG A6GCCAAG6G nGCTAGTTC CGCTCAATGT 
6551 TGATCCCCCA TGTTGTGCAA AAAAGCGGH AGCTCCTTCG GTCCTCCGAT 

ACTAGGGGGT ACAACACGTT TTTTCGCCAA TCGAGGAAGC CAGGAGGCTA 
6601 CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT ATCACTCATG GHATGGCAG 

GCAACAGTCT TCAHCAACC G6CGTCACAA TAGTGAGTAC CAATACCGTC 
6651 CACTGCATAA nCTCTTACT GTCAT6CCAT CCGTAAGATG CTmCTGTG 

GTGACGTATT AAGAGAATGA CAGTACGGTA GGCATTCTAC GAAAAGACAC 
6701 ACTGGTGAGT ACTCAACCAA GTCATTCTGA GAATAGTGTA TGCGGCGACC 

T6ACCACTCA TGAGTTGGTT CAGTAAGACT CTTATCACAT ACGCCGCTGG 
6751 GAGTTGCTCT TGCCCGGCGT CAATACGGGA TAATACCGCG CCACATAGCA 

CTCAACGAGA ACGGGCCGCA GTTAT6CCCT ATTATGGCGC GGTGTATCGT 
6801 GAACTTTAAA AGTGCTCATC ATTGGAAAAC GTTCnCGGG GCGAAAACTC 

CnGAAATTT TCACGAGTAG TAACCTTTTG CAAGAAGCCC CGCTTTTGAG 
6851 TCAAGGATCT TACCGCTGTT GAGATCCAGT TCGATGTAAC CCACTCGTGC 

AGTTCCTAGA ATGGCGACAA CTCTAGGTCA AGCTACAHG GGTGAGCACG 
6901 ACCCAACTGA TCTTCAGCAT CTTTTACTTT CACCAGCGH TCTGGGTGAG 

IGGGHGACT AGAAGTCGTA GAAAATGAAA 6TGGTCGCAA AGACCCACTC 
6951 C AAAAAC AG6 AAGGCAAAAT GCCGCAAAAA AGGGAATAAG GGCGACACG6 

GTTTTTGTCC nCCGTTTTA CGGCGTTTTT TCCCTTAnC CCGCTGTGCC 
7001 AAATGHGAA TACTCATACT CTTCCTnTT CAATATTATT GAAGCATTTA 

TTTACAACTT ATGAGTATGA GAAGGAAAAA GTTATAATAA CTTCGTAAAT 
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7051 TCAGGGTTAT T6TCTCATGA 6CGGATACAT ATTTGAATGT ATTTAGAAAA 
AGTCCCAATA ACAGAGTACT CGCCTATGTA TAAACTTACA TAAATCTm 

7101 ATAAACAAAT A6GG6TTCCG CGCACATTTC CCCGAAAAGT QCCACCTGAC 
TATTTGTTTA TCCCCAAGGC GCGTOTAAAG QGGCTTTTCA CSGTGGACTG 

7151 GTC 
CAG 
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DMA sequence of APP(1-651)NFEV, K612V-TATexonl(MlL) APP (664-695) 
(SEQ ID NO: 23) 



1 


ATGCTGCCCfi 


GTTTGGPAPT 

dill UuUAx/ 1 


RPTPPTGCTG 


RPPRPPTRRA 




51 


GCTGGAGGTA 


PPPAPTGATG 


GTAATRPTRR 


PPTGPTRRPT 


RAAPPPPARA 


101 


TTRPfATCTT 


PTGTRGPAGA 


PTRAAPJITRP 
V 1 Unnl/M 1 Uw 


APATRAATRT 

MuM 1 UMM 1 U 1 


PPARAATRRR 




AARTRfifiATT 


PARATPPATP 

UMUM 1 vrUM 1 L 


ARRRAPPAAA 
MVauuMvlrfMMM 


APPTRPATTR 


ATA PP A Arjf^ A 


CkJX 


Af5firATrrTC 

MUV3uM 1 v/v 1 V3 


PARTATTRPP 


AAHAARTPTA 
MMuMMu llrf 1 M 


PPPTRAAPTR 
KaKAj \ uMMLr 1 b 


PACATPAPPA 




ATRTCRTARA 


ARPPJIAPPAA 


PPARTRAPPA 


TPPARAAPTR 


RTRPAARPRQ 




fifirrcrAARr 


ARTRPAARAP 

MU 1 UWiMUMV/ 


PPATPPPPAP 


TTTRTRATTP 

1 1 lUlvlMI IKf 


PPTAPPRPTR 




PTTACTTrSf^T 
VyrI lAul lual 


RARTTTATAA 


RTCATRPPPT 


TPTPCTTPPT 


CAPAACTCPA 


Am 


AATTrTTAPA 


PPARRARARR 


ATRRATRTTT 
MluVaHIUl 1 1 


RPRAAAPTPA 


TPTTPAPTRR 




rAPArrfiTPf^ 

UMUMLrUla 1 uu 


PPAAARARAP 
UuMMMVaMuMU 


ATRPARTRAR 


AARARTAPPA 
MMUMU 1 MUL.M 


APTTRPATRA 
ML 1 1 uwM 1 UM 


OXJl 


PTArfif^PATfi 

U 1 MLfVaUUM 1 u 


TTRPTRPPPT 


RPRRAATTRA 
UUUuMM 1 1 VaM 


PAARTTPPRA 


RRRRTARART 

uUVaVS 1 MUMV3 1 


551 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 

winvirvvAvi I vl 


AQAATGTGGA 

rtV#flM 1 Vl 1 vluM 


TTCTGCTGAT 

1 1 \^ 1 VlV# 1 ViM 1 


601 


6CGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 

1 VI 1 W 1 VIVI 1 ViVI 


GGCGGAGCAG 

viviwivmviwnvi 


ACACAGACTA 


651 


TGCAGATG6G 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 

r\vl 1 r\ vlw/Avlr\vl 


GAGGAAGAAG 

unvivirv^virvtu 


701 

/ v J. 


TGRPTGAGGT 


GGAAGAAGAA 


GAAGPPGATG 


ATGAPGARGA 

r\ I \jM\-»viM\j\3M 


PGATGAGGAT 


751 


GGTRATGAGG 

\3vi 1 VIM 1 VJAAdvi 


TARAGGAAGA 


GGfTGAGGAA 

viwiw 1 vlrAvivWA 


PPPTAPRAAR 


AAGPPAPAGA 

MMuVVMUMvIM 


R01 


GARAAPPAPP 


ARPATTRPPA 

MOI./MI lUuLrM 


PPAPPAPPAP 


PAPPAPPAPA 


RARTPTRTRR 

UMU 1 U 1 vil Uvi 


R51 


AAGAGGTGGT 


TPRARTTPPT 


APAAPAGPAG 


PPACTApfpp 


TRATGPPGTT 
1 viM 1 viuVfia 1 1 


Q01 


GAPAAGTATP 


TPRARAPAPP 


TRRRRATRAR 

1 vn3vlvin 1 UnU 


AATRAAPATR 

nn 1 UMM UM 1 U 


PPPATTTPPA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAA6AG 


GCAGAACGTC 


AAGCAAAGAA 


CHGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCAITTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACC6 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTUrrCAA 


1251 


TATGCTAAAG 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 
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1351 ATCCGGTCCC AGGTTATGAC ACACCTCC6T GTGATTTATG AGC6CATGAA 
1401 TCAGTCTCTC TCCCTGCTCT ACAACGTGCC TGCAGTGGCC GAGGAGAHC 
1451 AGGATGAAGT TGATGAGCTG CTTCAGAAAG AGCAAAACTA TTCAGATGAC 
1501 GTCTTGGCCA ACATGATTAG TGAACCAAGG ATCAGTTACG GAAACGATGC 
1551 TCTCATGCCA TCTTT6ACCG AAACGAAAAC CACCGTGGAG CTCCTTCCCG 
1601 TGAATGGAGA GTTCAGCCTG GACGATCTCC AGCCGTG6CA TrCTTTTGGG 
1651 GCTGACTCTG TGCCAGCCAA CACAGAAAAC GAAGHGAGC CTGHGATGC 
1701CCGCCCTGCT GCCGACCGAG GACTGACCAC TCGACCAGGT TCTGQGTTGA 
1751 CAAATATCAA GACQGAGGAG ATCTCTGAAG TGAACTTTGA AGTGGAATTC 
1801 CGACATGAa CAGGATATGA AGTTCATCAT CAAGTATTGG TGTTOTTGC 
1851 AGAAGATGTG GGTTCAAACA AAGGTGCAAT CATTGGACTC ATGGTGGGCG 
1901 6TGTTGTCAT AGCGACAGTG ATCGTCATCA CCTTGGTGAT GCTGAAGAAG 
1951 AAAAA6CTTG GTACCGAGCT CGGATCCACT AGTCCAGTGT GGTGGAAnC 
2001 TGCAGATATC CTGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC 
2051 CAGGAAGTCA GCCTAAAACT GCTTGTACCA AHGCTAnG TAAAAAGTGT 
2101 TGCTTTCATT GCCAAGTTTG TnCATGACA AAAGCCTTAG GCATCTCCTA 
2151 TGGCAGGAAG AAGCGGAGAC AGC6AC6AAG AGCTCATCAG AACAGTCAGA 
2201 CTCATCAAGC TTCTCTATCA AAGCAGAGGA TATCCAGCAC AGTGGCGGCC 
2251 GCAGACGCC6 CTGTCACCCC AGAQGAGCGC CACCTGTCCA AGATGCAGCA 
2301 GAACGGCTAC GAAAATCCAA CCTACAAGTT CTTTGAGCAG ATGCAGAACT 
2351 AG 
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(SEQ ID NO: 24) 

Amino acid sequence of APP(1-651)NFEV, K612V-TATexonI(MlL) APP (664-695) 

ml pg1 a1 1 1 1 aawtaral evptdgnagi 1 aepql amf cgr1 nmhmnvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tnvveanqpvti qnwckrgrkqckthphf vi pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

a vdkyi etpgdenehahf qkakerl eakhrerrasqvmreweeaerqaknl pkadkkavi qhf qekvesi eqe 

aanerqql vethmarvearal ndrrri al enyi tal qavpprprhvf nml kky vraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthi rviyermnqsl si 1ynvpavaeeiqdevde1 Iqkeqnysddvl anmi seprisygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nfeyef rhdsgyevhhqyl vf f aedvgsnkgai 1 gl mvq qvviatv i vi tl vml kkk klgtclgstspvwwns 

6 

a4 f1epvdpr1 epwkhpgsqpktactncvckkccf hcqvcfmtkal gi syqrkkrrqrrrahansathaasl sko 

7 8 
ri sstvQQ a daavtDeerhl skmaanqvenptykf f eamon 
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DNA sequence of APP(1-651)NFEV. K612V-GAL4VP16(delMet) APP (664-695) 

(SEQ ID NO: 25) 



1 


ATGCT6CCCG 


GTTTGGCACr 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


51 


GCTGGAGGTA 


CCCACT6ATG 


GTAATGCTQG 


CCTGCTGGCT 


GAACCCCAGA 


101 


TTGCCAT&TT 


CTGTGGCAGA 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


151 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


ACCTGCATTG 


ATACCAAGGA 


201 


AGGCATCCTG 


CAGTAHGCC 


AAGAAGTCTA 


CCCTGAACTG 


CAGATCACCA 


251 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


301 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGAnC 


CCTACCGCTG 


351 


CTTAGTTGGT 


GAGTTTATAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


401 


AATTCnACA 


CCAGGAGAGG 


ATGGATGTTT 


GCGAAACTCA 


TCnCACTGG 


451 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


AAGAGTACCA 


ACTTGCATGA 


501 


CTACGGCATG 


nGCTGCCCT 


GCGGAAHGA 


CAAGTTCCGA 


GGGGTAGAGT 


551 


nGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


601 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTQG 


GGCGGAGCAG 


ACACAGACTA 


651 


TGCAGATGGG 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


701 


TGGCTGAGGT 


GGAAGAAGAA 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


751 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


CCCTACGAAG 


AAGCCACAGA 


801 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


GAGTCTGTQG 


851 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCHG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAQG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


QCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCA6CCAACG 


A6AGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


MTGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TTGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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1251 


TATGCTAAAG 


AAGTATGTCC GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG CGCATGGTG6 


ATCCCAAGAA 


AGCCGCTCAG 


1351 


ATCCGGTCCC 


AGGTTATGAC ACACCTCCGT 


GTGATTTAT6 


AGCGCATGAA 


1401 


TCACTCraC 


TCCCTGCTCT ACAACGTGCC 


TGCAGTGGCC 


GAQGAGATTC 


1451 


AGGATGAAGT 


TGATGAGCTG CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1501 


GTCnGGCCA 


ACATGAHAG TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


1551 


TCTCATGCCA 


TCTTTGACCG AAACGAAAAC 


CACCGTGGAG 


CTCCnCCCG 


1601 


TGAATGGAGA 


GTTCAGCCTG GACGATCTCC 


AGCCGTGGCA 


TTCTrmGGG 


1651 


GCTGACTCTG 


TGCCAGCCAA CACAGAAAAC 


GAAGTTGAGC 


CTGTTGATGC 


1701 


CCGCCCTGCT 


6CCGACCGAG GACTGACCAC 


TCGACCAGGT 


TCTGGGTTGA 


1751 


CAAATATCAA 


GACGGAGGAG ATCTCTGAAG 


TGAACnTGA 


AGTGGAATTC 


1801 


CGACATGACT 


CAGGATATGA AGTTCATCAT 


CAAGTAnGG 


TGTTCTTTGC 


1851 


AGAAGATGT^ 


GffTTCAAACA AAGGTGCAAT 


CATTCGACTC 


ATGGTGGGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG ATCGTCATCA 


CC7TGGTGAT 


GCTGAAGAAG 


1951 


AAAAAGCTTG 


GTACCGAGCT CQGATCCACT 


AGTCCAGTGT 


GGTGGAATTC 


2001 


TGCAGATATC 


AAGCTACTGT CTTCTATCGA 


ACAAGCATGC 


GATATTTGCC 


2051 


GACTTAAAAA 


GCTCAAGTGC TCCAAAGAAA 


AACCGAAGTG 


CGCCAAGTGT 


2101 


CTGAAGAACA 


ACTGGGAGTG TCGCTACTCT 


CCCAAAACCA 


AAAGGTCTCC 


2151 


GCTGACTAGG 


GCACATCTGA CAGAAGTGGA 


ATCAAGGCTA 


GAAAGACTGG 


2201 


AACAGCTATT 


TCTACT6ATT TTTCCTC6AG 


AAGACCTTGA 


CATGATTTTG 


2251 


AAAATQGATT 


CTTTACAGGA TATAAAAGCA 


TTGnAACAG 


GATTATITGT 


2301 


ACAAGATAAT 


GTGAATAAAG ATGCCGTCAC 


AGATAGAHG 


GCTTCA6TGG 


2351 


AGACTGATAT 


QCCTCTAACA TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATCG6AAG 


AGAGTAGTAA CAAAGGTCAA 


AGACAGTTGA 


CTGTATCGGG 


2451 


AATTCCCGGG 


GATCTGGCCC CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACHAGA 


CGGCGAGGAC GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GA7TTCGATC 


TGGACATGTT GGGGGACGGG 


GAHCCCCGG 


GTCCGGGAH 
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2601 TACCCCCCAC GACTCCfiCCC CCTACGGC6C TCTQGATATG GCCGACTTCG 
2651 AOirTGAGCA GATGTTTACC GATGCCCTTG GAATT6ACGA GTACGGTGGG 
2701 GATATCCAGC ACACTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAQCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATGCAGAAC TAG 
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(SEQ ID NO: 26) 

Amino acid sequence of APP(1-651)NFEV. K612V. GAL4-VP16(delMl) APP (664-695) 

ml pgl aim aavrtaral evptdgnagl 1 aepqi amf cgrl nmhninvqngkwdsdpsgtktci dt kegl 1 qycq 

evypel qi tnvveanqpvtl qnwckrgrkqckt hphf vi pyrcl vgef i sdal 1 vpdkckfl hqertndvcet hi h 

wht vaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttslatttttttesveevvrvpttaastpd 

a vdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkavi qhf qekvesl eqe 

aanerqql vethmarveaml ndrrri al enyi tal qa vpprprhvf rani kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqirsqvmthl rvlyermnqsl si lynvpavaeelqdevdellqkeqnysddvl anmi seprisygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl tt rpgsgl tni kteei sev 
2 3 4 5 

nfevef rhdsgyevhhqyl vf f aedvgsnkgai 1 gl mvq qvvlatv i v1 tl vml kk kkl gtel gstspvwwns 

adi -kl 1 ssi eaacdi crl kkl kcskekpkcakcl knnwecrvsoktkrspl trahl tevesrl erl eql f 11 i f predl d 

6 

mi 1 kmdsl odi kal 1 tol f vqdnvnkdavtdrl asvetdmpl tl rqhri sat ssseessnkgqrql tvsgi pgdl app 

tdvsl qdel hi dgedvamahadal ddf dl dml qdqdspqpqf tohdsapygal dmadf ef eqmf tdal gi dev 

7 8 
qadi qhsqaa ad aaytpeerhl skmaqnqvenptvkf f eqmqn 
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(SEQ ID NO: 27 AND 28) 

1 GAC6GATCGG 6AGATCTCCC 6ATCCCCTAT GGTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGHAG 
51 TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CmGrcTCTT 

ACGAGACTAC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG 

CaCCAGCGA CTCATCACGC GCTCGTTTTA AA7TCGATGT TGTTCCGTrC 
151 GCTTGACCGA CAATTGCATG AA6AATCTGC TTAGG6TTAG GCGTTTTGCG 

CGAACTGGCT GHAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACAn GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTAA CTAATAACTG 
251 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 

ATCAATAAH ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 
301 TGGAGnCCG CGHACATAA CHACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAATGTATT GAATGCCAH TACCGGGCGG ACCGACTGGC 
351 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGHGCTGG GGGCGGGTAA CTGCAGHAT TACTGCATAC AAGGGTATCA 
401 AACGCCAATA GGGACTTTCC AHGACGTCA ATGGGTGGAC TATTTACGGT 

TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGCCA 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCGGG 
501 CCTAT7GAC6 TCAATGACGG TAAATGGCCC QCCTGGCATT ATGCCCAGTA 

GGATAACTGC AGHACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT 
551 CATGACCHA TGGGACTTTC CTACnGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCAIGTAGH ACCCGCACCT 
651 TAGCGGnTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA HGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
701 TOGGAGITTG TTrTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG TrTTAGTTGC CCTGAAAGGT TTTACAGCAT 
751 ACAACTCCGC CCCAHGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGHGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGCnACTG 

CAGATATAH CGTCTCGAGA GACCGAHGA TCTCTTGGGT GACGAATGAC 
851 GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 

CGAATAGCTT TAATTATGCT GAGTGATATC CCTCTGGGTT CGACCGATCG 
901 GTTTAAACTT AAGCnGGTA CCGAGCTCGG ATCCACTAGT CCAGTGTGGT 

CAAATTTGAA TTCGAACCAT GGCTCGAGCC TAGGTGATCA GGTCACACCA 
951 GGAATTCTGC AGATATCCAG CACAGTGGCG GCCGCTCGAG TCTAGAGQGC 

CCTTAAGACG TCTATAGGTC 6TGTCACCGC CGGCGAGCTC AGATCTCCCG 
1001 CCGTTTAAAC CCGCTGATCA GCCTCGACTG TGCCTTCTAG TTGCCAGCCA 

GGCAAATTTG GGCGACTAGT CGGAGCTGAC ACGGAAGATC AACGGTCGGT 
1051 TCTGITGITT GCCCCTCCCC CGTGCCnCC TTGACCCTGG AAGGTGCCAC 

AGACAACAAA CGGGGAGGGG GCACGGAAGG AACTGGGACC TTCCACGGTG 
1101 TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG CATTGTCTGA 

AGGGTGACAG GAAAGGATTA TTTTACTCCT HAACGTAGC GTAACAGACT 
1151 GTAGGTGTCA HCTATTCTG GGGGGTGG6G TGG6GCAGGA CAGCAAGGQG 

CATCCACAGT AAGATAAGAC CCCCCACCCC ACCCCGTCCT GTCGTTCCCC 
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1201 GA6GATTGGG MGACMTAG CAG6CATGCT GGGGATGC66 TGGGCTCTAT 

CTCCTMCCC TTCTGnATC GTCCGTACGA CCCCTACGCC ACCCGAGATA 
1251 GGCTTCTGAG GCGGAAAGAA CCAGCTGG6G CTCTAGGQGG TATCCCCACG 

CCGAAGACTC CGCCTTTCn GGTCGACCCC 6AGATCCCCC ATAGGGGTGC 
1301 CGCCCTGTAG CGGCGCATTA AGCGCGGCQG G7GTGGTGGT TACGCGCAGC 

GCGGGACATC GCCGCGTAAT TCGCGCCGCC CACACCACCA ATGCGCGTCG 
1351 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 

CACTGGCGAT GFGAACGGTC GCGGGATCGC GGGCGAGGAA AGCGAAAGAA 
1401 CCCTTCCTTT CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC 

GGGAAGGAAA GAGCGGTGCA AGCGGCCGAA AGGGGCAGH CGAGATTTAG 
1451 GGGGCATCCC nTAGGGTTC CGATTTAGre CHTACGGCA CCTCGACCCC 

CCCCGTAGGG AAATCCCAAG GCTAAATCAC GAAATGCCGT GGAGCTGGGG 
1501 AAAAAACTTG AHAGGGTGA IGGHCACGT A6TGGGCCAT CGCCCTGATA 

TmrrGAAC taatcccact accaagtgca tcacccggta gcgggactat 
1551 gacggtmr cgccctttga cgttggagtc cacgncttt aatagtggac 

ctgccaaaaa gcgggaaact gcaacctcag gtgcaagaaa ttatcacctg 
1601 tcttgttcca aactqgaaca acactcaacc ctatctcggt ctatrctttt 

AGAACAAGGT TTGACCnGT TGTGAG7T6G GATAGAGCCA GATAAGAAAA 
1651 GATTTATAAG GGATTTTGGG GATTTCGGCC TAnGGTTAA AAAATGAGCT 

CTAAATAHC CCTAAAACCC CTAAAGCCQG ATAACCAATT TnTACTCGA 
1701 GA7TTAACAA AAATTFAACG CGAATTAATT CTGTGGAATG TGTGTCAGTT 

CTAAATTGTT TTTAAAnGC GCHAAHAA GACACCHAC ACACA6TCAA 
1751 AGGGTGTGGA AAGTCCCCAG GCTCCCCAGG CAGGCAGAAG TATGCAAAGC 

TCCCACACCT TTCAGGGGTC CGAGGGGTCC GTCCGTCTTC ATACGTTTCG 
1801 ATGCATCTCA ATTAGTCAGC AACCAGGTGT GGAAAGTCCC CAGGCTCCCC 

TACGTAGAGT TAATCAGTCG nOGTCCACA CCTTTCAGGG GTCCGAGGGG 
1851 AGCAGGCAGA AGTATGCAAA GCATGCATCT CAAHAGTCA GCAACCATAG 

TCGTCCGTCT TCATACGTTT CGTACGTAGA €TTAATCAGT CGTTGGTATC 
1901 TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC CAGTTCCGCC 

AGGGCGGGGA TTGAGGCGGG TAGGGCGGGG ATPGAGGCGG GTCAAGGCGG 
1951 CAHCTCCGC CCCATGGCTG ACTAATTTTT TTTATrTATG CAGAGGCCGA 

GTAAGAGGCG GGGTACCGAC TGATTAAAAA AAATAAATAC 6TCTCC6GCT 
2001 GGCCGCCTCT GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG 

CCGGCGGAGA CGGAGACTCG ATAAGGTCTT CATCACTCCT CCGAAAAAAC 
2051 GAGGCCTAGG CTTTTGCAAA AAGCTCCCGG GAGCTTGTAT ATCCATTTTC 

CTCCGGATCC GAAAACGTTT TTCGAGG6CC CTC6AACATA TAGGTAAAAG 
2101 GGATCTGATC AGCACGTGTT GACAATTAAT CATCGGCATA GTATATCGGC 

CaAGACTAG TC6TGCACAA CTGTTAA7TA GTAGCC6TAT CATATAGCCG 
2151 ATAGTATAAT ACGACAAGGT GAGGAACTAA ACCATGGCCA AGHGACCAG 

TATCATAHA TGCTGTTCCA CTCCnGATT TGGTACC6GT TCAACTGGTC 
2201 TGCCGTTCCG GTGCTCACCG CGCGCGACCT CGCCGGAGCG GTCGAffTTCT 

ACGGCAAGGC CACGA6TGGC GCGCGCTGCA GCGGCCTCGC CAGCTCAAGA 
2251 GGACCGACCG GCTCGGGHC TCCCGGGACT TCGTGGAGGA CGACHCGCC 

CCTGGCTGGC CGAGCCCAAG AGGGCCCTGA AGCACCTCCT GCT6AAGCGG 
2301 GGTGTGGTCC GGGACGACGT GACCCTGTTC ATCAGCGCGG TCCAGGACCA 

CCACACCAGG CCCTGCTGCA CTGGGACAAG TAGTCGCGCC AGGTCCTGGT 
2351 GGTOGTIGCCG GACAACACCC TGGCCTGGGT GTGGGTGCGC GGCCTQGACG 

CCACCACGGC CTGnGTGGG ACCGGACCCA CACCCACGC6 CCGGACCTGC 
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2401 AGCT6TACGC CGAGTGGTCG GAGGTCGTGT CCACGAACTT CCGQGACGCC 

TCGACATGCG QCTCACCAGC CTCCAGCACA GGTGCTTGAA GGCCCTGCQG 
2451 TCCG6GCCGG CCATGACCGA GATCGGCGAG CAGCCGTGGG GGCGGGAGTT 
■ AGGCCCGGCC GGTACTGGCT CTAQCCGCTC GTCGGCACCC CCGCCCTCAA 
2501 CGCCCTGCGC GACCCGGCCG GCAACTGCGT GCACTTCGnTG GCCGAGGAGC 

GCGGGACGCG CTGGGCCGGC CGTTGACGCA CGTGAAGCAC CGGCTCCTCG 
2551 AGGACTGACA CGTGCTACGA GATTTCGAn CCACCGCCGC CTTCTATGAA 

TCCTGAaGT GCACGATGCT CTAAAGCTAA GGTGGCGGCG GAAGATAOT 
2601 AGGHGGGCT TC6GAATCGT TTTCCGGGAC GCCGGCTGGA TGATCCTCCA 

TCCAACCCGA AGCCTTAGCA AAAQGCCCTG CGGCCGACCT ACTAGGAGGT 
2651 GCGCQGGGAT CTCATGCTOG AGTTCTrCGC CCACCCCAAC TTlEmTATrG 

CGCGCCCCTA GAGTACGACC TCAAGAAGCG GGTGGGGTTG AACAAATAAC 
2701 CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT 

GTCGAATATT ACCAATGTTT ATrrCGlTAT CGTAGTGnT AAAGTGmA 
2751 AAAGCATTTT TTTCACTGCA nCTAGTTGT QGTTTGTCCA AACTCATCAA 

TTTCGTAAAA AAAGTGACGT AAGATCAACA CCAAACAGGT HGAGTAGn 
2801 TGTATCTTAT CATGTCTGTA TACCGTCGAC aCTAGCTAG AGCTTGGCGT 

ACATAGAATA GTACAGACAT ATGGCAGCT6 GAGATCGATC TCGAACCGCA 
2851 AATCATG6TC ATAGCTGTTT CCTGTGTGAA ATTGnATCC GCTCACAAH 

TTA6TACCAG TATCGACAAA GGACACACTT TAACAATAGG CGAGTIGTTAA 
2901 CCACACAACA TACGAGCCGG AAGCATAAA6 TGTAAAGCCT GGGGTGCCTA 

GGTGTGTTGT ATGCTCGGCC nCGTATTTC ACATTTCGGA CCCCACGGAT 
2951 ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 

TACTCACTCG ATTGAGTGTA AHAACGCAA CGCGAGTGAC GGGCGAAAGG 
3001 AGTCGGGAAA CCT6TCGTGC CAGCTGCAH AATGAATCGG CCAACGCGCG 

TCAGCCCTFT GGACAGCACG GTCGACGTAA TTACTTAGCC GGTTGCGCGC 
3051 GGGAGAGGCG GTTTGCGTAT TG66C6CTCT TCCGCHCCT CGCTCACTGA 

CCCTCTCC6C CAAACGCATA ACCCGCGAGA AG6CGAAGGA GCGAGTGACT 
3101 CTCGCTGCGC TCGGTCGTTC GGCTGCQGCG AGCGGTATCA GCTCACTCAA 

GAGCGACGCG AGCCA6CAAG CC6ACGCCGC TCGCCATAGT CGAGTGAGTF 
3151 AGGCGGTAAT ACGGITATCC ACAGAATCAG GGGATAACGC AGGAAAGAAC 

TCCGCCATTA TGCCAATAGG TGTCnAGTC CCCTATTGCG TCCTTTCTTG 
3201 ATGTGAGCAA AAGGCCAGCA AAAGGCCAG6 AACCGTAAAA AGGCCGCGTT 

TACACTCGH TTCCGGTCGT TTTCCGGTCC TTGGCATrTT TCCGGCGCAA 
3251 QCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC 

CGACCGCAAA AAGGTATCCG AGGCGGG6GG ACTGCTCGTA GTGTnTTAG 
3301 GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG 

CTGCGAGTTC AGTCTCCACC GCTTTGQGCT GTCCTGATAT TTCTATGGTC 
3351 GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC 

CGCAAAGGGG GACCHCGAG GGAGCACGCG AGAGGACAAG GCTGGGACGG 
3401 GCTTACCQGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT 

CGAATGGCCT ATGGACAGGC GGAAA6AGGG AAGCCCTTCG CACCGCGAAA 
3451 CTCAATGCTC ACGCTGTAGG TATCTCAGH CGGTGTAGGT CGHCGCTCC 

GAGITACGAG TGCGACATCC ATAGAGTCAA GCCACATCCA GCAAGCGAGG 
3501 AAGCT6GGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 

nCGACCCGA CACACGTGCT TGGGGGGCAA 6TCGG6CTGG CGACGCGGAA 
3551 ATCC6GTAAC TATCGTCTTG AGTCCAACCC 6GTAAGACAC GACTTATCGC 

TAGGCCAHG ATAQCAGAAC TCAGGTTGGG CCATTCTGTG CTGAATAGCG 
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3601 CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGC6AG GTATGTAGGC 

GTGACCGTCG TCGGTGACCA TTGTCCTAAT CGTCTCGCTC CATACATCCG 
3651 GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG 

CCACGATGTC TCAAGAACTT CACCACCGGA TTGATGCCGA TGTGATCnC 
3701 GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA 

CTGTCATAAA CCATAGACGC GAGACGACTT CGGTCAATGG AAGCCTmT 
3751 GAGTTGGTAG CTCTT6ATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT 

CTCAAC CATC GAGAACTAGG CCGmGI 1 1 GGTGGCGACC ATCGCCACCA 
3801 TmTTGTTT GCAAGCAGCA GAHACGCGC AGAAAAAAAG GATCTCAAGA 

AAAAAACAAA CGHCGTCGT CTAATGCGCG T CI 1 1 1 1 1 I C CTAGAGTTCT 
3851 AGATCCTTTC ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 

TCTAG6AAAC TAGAAAAGAT GCCCCAGACT GCGAGTCACC TTGCTTTTGA 
3901 CACGTTAAGG GATnTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG 

GTGCAATTCC CTAAAACCAG TACTCTAATA GTTTTTCCTA GAAGTCGATC 
3951 ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA 

TAGGAAAAH TAATTTTTAC TTCAAAATn AGTTAGAITT CATATATACT 
4001 GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT 

CATTTGAACC AGACTGTCAA TGGTTACGAA TTAGTCACTC CGTGGATAGA 
4051 CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG 

GTCGCTA6AC AGATAAAGCA AGTAGGTATC AACGGACTGA GGGGCAGCAC 
4101 TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 

ATCTATTGAT GCTATGCCCT CCCGAATGGT AGACCGGGGT CACGACGTTA 
4151 6ATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 6CAATAAACC 

CTATGGCGCT CT6GGTGCGA GTGGCCGAGG TCTAAATAGT CGTTATTTGG 
4201 AGCCAGCCGG AAGGGCCGA6 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC 

TCGGTCGGCC HCCCGGCTC GCGTCTTCAC CAGGACGTTG AAATAGGCGG 
4251 TCCATCCAGT CTATTAATTG HGCCGGGAA GCTAGAGTAA GTAGTTCGCC 

AGGTAGGTCA GATAATTAAC AACGGCCCTT CGATCTCATT CATCAAGCGG 
4301 AGTTAATAGT TTGCGCAACG nCTTGCCAT TGCTACAGGC ATCGTGGTGT 

TCAATTATCA AACGCGTTGC AACAACGGTA ACGATGTCCG TAGCACCACA 
4351 CACGCTCGTC GTTTGGTATG GCTTCAnCA GCTCCGGHC CCAAC6ATCA 

GTGCGAGCAG CAAACCATAC CGAAGTAAGT CGAGGCCAAG GGTTGCTAGT 
4401 AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 

TCCGCTCAAT GTACTAGGGG GTACAACACG IIIIIICGCC AATCGAGGAA 
4451 CGGTCCTCCG ATCGTTGTCA GAAGTAAGH GGCCGCAGTG TTATCACTCA 

GCCAGGAGGC TAGCAACAGT CTTCAnCAA CCGGCGTCAC AATAGTGAGT 
4501 TGGTTATGGC AGCACTGCAT AAnCTCTTA CTGTCATGCC ATCCGTAAGA 

ACCAATACCG TCGTGACGTA HAAGAGAAT GACAGTACGG TAGGCAHCT 
4551 TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATA6TG 

ACGAAAA6AC ACTGACCACT CATGAGTTGG HCAGTAAGA CTCHATCAC . 
4601 TATGCGGCGA CCGAGTTGCT CTTGCCCGGC 6TCAATACGG GATAATACCG 

ATACGCC6CT GGCTCAACGA GAACGGGCCG CAGTTATGCC CTATTATGGC 
4651 CGCCACATAG CAGAACTHA AAAGTGCTCA TCAnGGAAA ACGTTCTTCG 

GCGGTGTATC GTCTTGAAAT TTTCACGAGT AGTAACCTTT TGCAAGAAGC 
4701 GGGCGAAAAC TCTCAAGGAT CHACCGCTG TTGAGATCCA GTTCGATGTA 

CCCGCTTTTG AGAGTTCCTA GAAT6GCGAC AACTCTAGGT CAAGCTACAT 
4751 ACCCACTCGT GCACCCAACT GATCTTCAGC ATCmTACT TTCACCAGCG 

TGGGTGAGCA CGnrGGGHGA CTAGAAGTCG TAGAAAATGA AAGTGGTCGC 
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4801 TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA 

AAAGACCCAC TCGTTTTTGT CCnCCGlTT TACGGCGTTT TTTCCCTTAT 
4851 AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCnT HCAATATTA 

TCCCGCTBTG CCTTTACAAC HATGAGTAT GAGAAGGAAA AAGTTATAAT 
4901 TTGAAGCATT TATCAGGGH ATTGTCTCAT GAGC6GATAC ATATTTGAAT 

AACTTCGTAA ATAGTCCCAA TAACAGAGTA CTCGCCTATG TATAAACTTA 
4951 GTATTTAGAA AAATAAACAA ATAGQGGTTC CGCGCACATT TCCCCGAAAA 

CATAAATCTT TTrATTTGrr TATCCCCAA6 GCGCGTGTAA AGGGGCTTTT 
5001 GTGCCACCTG ACGTC 

CACGGIGGAC TGCAG 
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DNA sequence of APP(1-651)NFEV, TATexonl(MlL) APP (664-695) 
(SEQ ID NO: 29) 



1 


ATGCTGfrrfi 


fiTTTRRPAPT 


RPTPPTRPTR 
UulUUIuu lU 


RPPRPPTCRA 
UUVfUUUIUUA 


PCRPTPRCCr 

t^uLluuulK/ 


51 




PPPAPTRATR 

VrUUMVr i UM 1 U 


RTAATRPTCR 
UlMMIUuluu 


PPTRPTRRPT 
1 UU 1 UUU 1 


RA APPPP ACA 

uAAuUl/uAoA 


ini 


TTCrrATfiTT 
1 luUv»nluil 1 


PTRTRRPARA 
V 1 u 1 uuw\uM 


PTRAAPATCP 
L 1 UAAlfM 1 UU 


APATPAATPT 
AuAluAAIul 


rr AP A ATMP 

Lw\uAAIuuu 


1R1 


MM\3 1 \3\3V3rM 1 


PARATPPATP 


ARCRAPPAAA 
AuuuAULAAA 


APPTPPATTP 

ALLIuLAI lu 


ATATPA APPA 


901 


AfiRPATPrTf; 
MUuuM 1 l^U 1 u 


PARTATTRPP 
UMUiMI lUUl/ 


AARAACTPTA 
MMuAAu 1 U 1 A 


PPPTCAAPTP 

Lulr 1 uAAU 1 u 


PAPATPAPPA 

uAuAIUAuuA 


PR! 


M 1 U 1 UU 1 MUM 


ARPPJVAPPAA 
MUULMMV/V/MM 


PPARTRAPPA 
LUMU 1 uMLw\ 


TPPARAAPTR 
1 ulrfMuMAU 1 U 


RTRPAARPRR 
U 1 ul/AAuuuu 




ftfirr/sPAAfir 

UuuvUUMMUU 


ARTRPAARAP 
MU 1 UuMMUnl/ 


PPATPPPPAP 


TTTRTOATTP 
i 1 lUluAl lu 


PPTAPPfSPTP 

UUlAUuuLIu 


OiJX 


rTTARTTGPiT 

V/ 1 1 M\3 1 1 CK3 1 


RARTTTATAA 

UMU 1 1 1 M 1 MM 


RTRATRPPPT 
U 1 UM 1 UUtrU 1 


TPTPRfTTPPT 
lUIUUI lUUI 


RAPAARTRPA 
uAUAAu 1 Utr A 




AATTrTTAfA 


PPARRARARR 

UUMUUMUMUU 


ATRRATRTTT 

MICnSMIUi 1 1 


RPRAAAPTPA 
UUUMMMU I UM 


TPTTPAPTRR 
1 U 1 1 UMU 1 UU 






PPAAAGARAP 


ATRPARTRAR 

M 1 UUMU 1 UMU 


AARARTAPPA 

MMUMU 1 MUUM 


APTTRPATRA 
MU 1 1 uUM 1 UM 


501 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 

UMMU 1 1 UUUM 


GGGGTAGAGT 

UUUU 1 MUMU 1 


551 


TTGTGTGTTG 

1 fVIIVIIVII IWI 


CCCACTGGCT 


RAAGAAARTG 


APAATRTRRA 

MUMM 1 U 1 UUM 


TTPTRPTGAT 

1 1 U 1 UU 1 UM 1 


601 


GCGGAGGAGG 


ATRACTCGGA 

r\ 1 UMU 1 wUUM 


TRTPTRRTRR 

lUIUIUUIUU 


RRPRRARPAR 

UUUUUMUUMU 


APAPARAPTA 

MUMUMUMU 1 M 


651 


TGTAGATCRG 


ARTRAARAPA 

MU 1 UMMUMV/M 


AARTARTARA 

MMU 1 MU 1 MUM 


ARTARPARAR 

MU 1 MUUMUMU 


RARRAARAAR 

UMUUMMUMMU 


701 


I uV3v» 1 UMV3V3 1 


RRAARAARAA 

UUMMUMMuMM 


RAARPPRATR 

UMMUuUUM 1 U 


ATRAPRARRA 
M 1 UMUuMuUM 


PRATRARRAT 
UUA 1 uMuUA 1 


751 


UU 1 UM 1 UMUU 


TARARRAARA 
1 MUMUUnMUM 


RRPTRARRAA 
U\A/1UMUUMM 


PPPTAPRAAR 
LUlflAuuAAu 


AARPPAPARA 
AAuuLAuAuA 


ftni 

OUl 


RARAAPPArr 
UMUMMUVrMLrU 


ARPATTRPPA 
MUIfMi 


PPAPPAPPAP 
uUAV/UAUUALr 


PAPPAPPAPA 
uAuUALuAuA 


RARTPTRTRC 
uAu 1 U 1 U 1 UU 


ODX 


AARARRTRRT 
MMuMuu 1 Vau 1 


TPRARTTPPT 
lUUMUl ILui 


APAAPARPAR 
AuAAUAuUAu 


PPARTAPPPP 
Ul/Au 1 AUULu 


TCATCPPCTT 
luAluLuul 1 




CApAARTATP 


TPRARAPAPP 

1 IrUMUMl/ALlr 


TRRRRATRAR 
luuuuAluAu 


A AT/2 A AP ATP 
AA 1 uAAV/A 1 u 


PPPATTTPPA 

uULAI 1 ItuA 


951 


GAAAGCCAAA 


GAGAGGCHG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCA6G 


1001 


TCATGAGAGA 


ATGGGAAGAG 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGHAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATG6 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTG6C 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCT6CAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


1251 


tatgctaaAg 


AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


1301 


TAAAGCATTT 


CGAGCATGTG 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 
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1351 


ATCCGGTCCC 

f\ 1 wWUU 1 w\#0 


AGGTTATGAC ACAPCTrf GT GTfiATTTATG 

nvlvl 1 1 M 1 unu rWjTlww I o\«vl 1 vl 1 ur\ 1 1 I n I vl 


AGTGrATGAA 


1401 


TCAGTCTCTC 


TCCCTGCTCT ACAACGTGCC TGCAGTGGCC 

1 vW 1 VlVr 1 w 1 r^V#nnwVl l VlwV l MWI^VI I MVlMVr 


GAGGAGATTf 

VlflVlvVAW% 1 1 Vr 


1451 


AGGATGAAGT 


TGATGAGCTG CTTCAGAAAG AGCAAAACTA 


TTCAGATGAC 

1 1 wM vm 1 w\ w 


1501 


GTCTTGGCCA 


ACATGATTAG TGAACCAAGG ATCAGTTACG 

nwri 1 vin 1 1 #ivi 1 viTwiv/ v/nn VI vi r\ 1 v^rrvi i i nvvi 


GAAACGATGC 

virwTVvio 1 VI V 


1551 


TCTCATGCCA 


TCTTTGACCG AAAC6AAAAC CACCGTGGAG 

1 w 1 1 1 vir^vV/Vi nnnwvirwTnvy wnv vvi i vivir^vi 


CTCCTTCCCG 

W 1 WW 1 1 V/wwvl 


1601 


TGAATGGAGA 


GTTCAGCCTG GACGATCTCC AGCCGTGGCA 

VI 1 1 V#»ViW 1 VI V»»»WW» 1 >/ 1 WW r%VIWWVI 1 VIV(V#i 


TTCTTTTGGG 

i 1 W 1 1 i f VIViVI 


1651 


GCTGACTCTG 


TGCCAQCCAA CACAGAAAAC GAAG7TGAGC 

I viw%/v iviwwr V » wff »WF %vif www l W VW W IVI 1 1 W IVi W 


CTGTTGATGC 


1701 


CCGCCCTGCT 


GCCGACCGAG GACTGACCAC TCGACCAGGT 


TCTGQGirreA 


1751 


CAAATATCAA 


GACGGAGGAG ATCTCTGAAG TGAACTTTGA 


AffraGAATTC 


1801 


CGACATGACT 


CAG6ATAT6A AGTTCATCAT CAAAAATTGG 


TGITCTTTGC 


1851 


A6AAGATGTG 


GGTTCAAACA AAGGTGCAAT CATTGGACTC 

VIVI 1 1 w# W W»W# » IV «VIVI 1 Vlw* W 1 1 w» » 1 1 VIVW »W 1 w 


ATGGT6GGCG 


1901 


GTGTTGTCAT 


AGCGACAGTG ATC6TCATCA CCTTGGTGAT 

r«viw%4fiw«ivi 1 VI n 1 wvt 1 wfi • wf » WW i i vivi i vw* i 


GCTGAAGAA6 

WW 1 VM W W tVI 


1951 


AAAAAGCTTG 

J If W w w iVIW 1 1 Vi 


GTACCGAGCT CGGATCCACT AGTCCAGTGT 

Vi 1 #\w wVlrWIw 1 N/VIVir* 1 wwf»w 1 fftW 1 wwfiVI 1 VI 1 


GGTGGAATTC 

WW i ViVifW* i I 


2001 


TGCAGATATC 


CTGGAGCCAG TAGATCCTA6 ACTAGAGCCC 

\^ 1 viviftviwrivi 1 f»w* 1 WW 1 r^vi f»w ■ f»vw»viwww 


TGGAAGCATC 


2051 


CAGGAAGTCA 

wr«vivw w «vi 1 w/» 


GCCTAAAACT GCTTGTACCA ATTGCTATTG 

viww 1 nnnnw i viw i i vi i nwwri i i viw i n i i vi 


TAAAAAGTGT 

1 « w w w w mw 1 Vi 1 


2101 

<mXW X 


TGCTTTCATT 

1 Vlw III V/< 1 1 


GCCAAGTTTG TTTCATGACA AAAGCCTTAG 

Vlww# w»vi 1 » 1 VI III w#% 1 vi/iwri #TrT#%viww i i f \\a 


GCATCTCCTA 

WWf » 1 W 1 WW 1 »» 


2151 


TGGCAGGAAG 


AAGCGGAGAC AGCGACGAAG AGCTCATCAG 


A A A ^nr/^ A O A 

AACAGTCAGA 


2201 


CTCATCAAGC 


TTCTCTATCA AAQCAGAGGA TATCCAGCAC 


AGTGGCGGCC 


2251 


GCAGACGCCG 


CTGTCACCCC AGAGGAGCGC CACCTGTCCA 


AGATGCAGCA 


2301 


GAACGGCTAC 


GAAAATCCAA CCTACAAGTT CTTTGAGCAG 


ATGCAGAACT 


2351 


AG 
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(SEQ ID NO: 30) 

Amino acid sequence of APP(1-651)NFEV. K612V-TATexonI(MlL) APP (664-695) 

ml pgl aim aavrtaral evptdgnagl 1 aepqi amf cgrl nmhnmvqngkwdsdpsgtktci dtkegi 1 qycq 

evypel qi tn vveanqpvti qnwckrgrkqckt hphf vi py rcl vgef i sdal 1 vpdkckfl hqermdvcethl h 

whtvaketcsekstnl hdygral 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkvvevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyl etpgdenehahf qkakerl eakhrermsqvmreweeaerqaknl pkadkkaviqhfqekvesl eqe 

aanerqql vethmarveaml ndrrrl al enyi tal qa vpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvmthl rviyerrnnqsl si lynvpavaeeiqdevdel Iqkeqnysddvl anrai sepri sygndal 

npsl tetkttvel 1 pvngef si ddl qpwhsf gads vpantenevepvdarpaadrgl ttrpgsgl tni kteei sev 
2 3 4 5 

nfevef rhdsgyevhhqkl vf f aedvgsnkgai iglmvggyyiatyi vi tl vml kk kklgtelgstspvwwns 

6 

ad^ -l epvdprl epwkhpgsqpktactncvckkccf hcQ vcf nitkal gi sygrkkrrQrrrahqnsqthaasl skq 

7 8 
Pi sstvQQa daavtpeerhl skmqqngvenptykf f egmqn 
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DMA sequence of APP(1-651)MFEV. GAL4VP16(delMet) APP (664-695) 



fccn 


in Kin . oi \ 
ID NO: 31) 










1 


ATQCTGCCC6 


PTTTPPP APT 

6TTT66CACT 


GCTCCTGCT6 


GCCGCCTGGA 


CGGCTC6GGC 


51 


GCTuGAGGTA 


PPP APTPATP 

CCCACT6ATG 


PTA ATPPTPP 

6TAATGCTGG 


CCTGCTGGCT 


PA APPPP AP A 

GAACCCCAGA 


101 


TTGCCAT6TT 


PTr»Tpr*p A p A 

CTGTGGCAGA 


PTP A A P ATPP 

CTGAACATGC 


AP ATPA ATPT 

ACATGAATGT 


PP A/^A ATOOO 

CCAGAATGGG 


1 CI 

Ibl 


A A/^T/^/^/^ ATT 

AA6TG6GATT 


PAPATPPATP 

CA6ATCCATC 


APPPAPPA A A 

AGGGACCAAA 


APPTPPATT/^ 

ACCT6CATT6 


ATA PPA APP A 

ATACCAAQGA 


201 


AGGCATCCT6 


P A /*T ATTPPP 

CA6TATTGCC 


A APA APTPTA 

AA6AAGTCTA 


PPPTP A APT/^ 

CCCT6AACT6 


PAP ATP APP A 

CAGATCACCA 


251 


A Tr*Tor*T Ar* A 

ATGTGGlAGA 


APPPA APP A A 

AGCCAACCAA 


PP APTP APP A 

CCAGTGACCA 


Tf*r* A P A A PT^ 

TCCAGAACTG 


PTPP A Af*f^f*r* 

GTGCAAGCGG 


301 


GGCCGCAAGC 


A PTPP A A P A P 

AGTGCAAGAC 


PP ATPPPP A 

CCATCCCCAC 


111 PTP A TTP 

TTTGTGATTC 


PPTAPPPPTP 

CCTACCGCTG 


351 


CTTAGTTGGT 


A n"T' A *T* A A 

GAGTTTATAA 


GT6ATGCCCT 


TOTO OTTO OT 

TCTCGTTCCT 


A O A A OTOO A 

GACAAGTGCA 


it A -n 

401 


A ATTr*'l"r A r» A 

AATTCTTACA 


CCAGGA6AGG 


A TOO A TOTTTTT 

AT6GATGTTT 


OOOA A A OTO A 

GCGAAACTCA 


TOTTO A OTOO 

TCTTCACTGG 


451 


CACACCGTCG 


/^/^A A A/^A/>A^ 

CCAAAGAGAC 


A TOO A OTO A O 

ATGCAGTGAG 


A A Ar^TAOO A 

AAGAGTACCA 


A OT T PP A TO A 

ACTTGCATGA 


501 


CTACGGCATG 


TT6CTGCCCT 


GCGGAATTGA 


P A A PTTPPP A 

CAAGTTCCGA 


PPPPT APA PT 

GGGGTAGAGT 


551 


TTGTGTGTTG 


f%fsr* A /^TPPPT 

CCCACTGGCT 


P A A P A A A fVrf^ 

GAAGAAAGTG 


APA ATPTPPA 

ACAATGTGGA 


TTPTPPTP AT 

TTCTGCTGAT 


601 


GCGGAGGAGG 


A TP A I^TPPP A 

ATGACTCGGA 


TPTPTPPTPP 

TGTCTGGTGG 


OOPPP A PP A P 

GGCGGAGCAG 


AP AP APAPTA 

ACACAGACTA 


651 


TGCAGATGGG 


APTP A AC* A/* A 

AGTGAAGACA 


A APTAPTAPA 

AAGTA6TAGA 


APTAPPAPAP 

A6TA6CA6A6 


PAr*/^A APA AP 

GAGGAAGAAG 


7A1 

701 


xpp r*Tr» A p PT 
TGGCTGAGGT 


PPA APA APA A 

GQAAGAAGAA 


PA APPPPATP 

GAAGCLGATG 


ATP A PPAPP A 

A 1 GALliAUGA 


PP ATP APP AT 

uGA 1 uAuGA 1 


"TCI 

7ol 


PPTPATPAPP 

GGTbATQAGG 


TAPAPPA APA 

TAGA6QAA6A 


PPPTP A PP A A 

GGClGAbGAA 


PPPTAPPA AP 

LLL 1 ACGAAb 


A A/^P APAPA 

AAGLLALAGA 


OA1 

801 


PAPA APP APP 

GAbAAtLALl/ 


APP ATTPPPA 


PPAPP APP AP 

ttAOLAl/LAu 


rAPPAPPAPA 

LALtALLALA 


GAb 1 1 1 b i bb 


851 


A A P A / V*TPPT 

AAGAuGTGGT 


TPP A ^^TTr/*T 

TCGA6TTCCT 


ACAACAGuAb 


LOAulALLLL 


TPAT/^rrPTT 


901 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


951 


GAAAGCCAAA 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


1001 


TCATGAGAGA 


ATGGGAAGAG 


QCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


1051 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


CAGGAGAAAG 


TGGAATCTTT 


1101 


GGAACA6GAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


ACACACATGG 


1151 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1201 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 
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AA/2TATCTPP CPCPACAAPA 

AAblAlblV/L btbtAbAALA 


CAA(2CAPAf2A 

bAAbbAtAbA 


pAriPAPAppp 

tAbtAtAttt 




TAAA^ATTT 


ppAPP ATCTn crchTrrTrr 
LbAbUAIblb UbUAIbbIbb 


ATPPPAA/2AA 
A 1 ttt AAbAA 


ACPPriPTPAP 

Abttbt 1 t Ab 


XODl 




APPTTAT^AP APAPPTPPPT 

Abb) lAibAU AtAtLlLLbi 


PTP AMI AT(2 

btbAl 1 lAlb 


ACP^^PATPA A 

Abtbt A 1 bAA 


1 /ni 
14Ui 




XP P PX/V^TPX A P A A P PXPP P 

1 LLt 1 bt 1 t 1 ALAALb 1 bCL 


IbtAblbbtt 


PA/^APAXXP 

bAbuAbAI It 


1451 


AuGATGAAGT 


XPAXPAPPXP PXXPAPAAAP 

TGATGAGCTb CTTCAGAAAG 


APO A A A APXA 

AGCAAAACTA 


XXP APAXOAO 

TTtAbATGAC 


1501 


GTCTTGGCCA 


APAXOATXAP XPAAPPAAPP 

ACATGATTAb TbAACCAAbb 


AXP AOXXAPP 

ATCAGTTACG 


PA A APPAXPO 

GAAACGATGC 


1551 


TCTCATBCCA 


X/*XXXr»A/*PO AAAOOAAAAP 

TCTTTGACCb AAACGAAAAC 


A oooxor* AP 

CACCGTGGAG 


PXOPXXPPOO 

tTtCTTtttG 


1601 


TO A ATO^ AO A 

TGAATGGAGA 


OXXO A OOPXO O A PO A X/*X/*0 

GTTCAGCCTG GACGATCTCC 


AGCCGTGGCA 


xxo'i' r i'Tor*r* 

TTCTTTTGQG 


1651 


OPTY* A /*TOXO 

GCTGACTCTG 


XOOr* AO/*/* A A /*AP AOA A A A/* 

TGCCAGCCAA CACAGAAAAC 


OA AOXXOAOn 

GAAGTTGAGt 


oxoxxr* A xr*r* 

tTGTTGATGt 


1/01 


LLliLLtTSCT 


f^rr^ArrnAf^ papxpappap 
bLLbALLbAb bALlbALLAt 


Tcn Arr AnoT 
1 tGAttAbbl 


1 t 1 GbbTTbA 


1 TCI 

1/51 


PA A AX ATP A A 

CAAATATCAA 


PAPr>r*APPAP AXPXPXPA AP 

bACbbAbbAG ATCTCTbAAb 


XPA APXXXPA 

TbAAtTTTbA 


A PXPP A A XXP 

Ab 1 bbAATTt 


loOl 


PPAPAXPAPX 

CBACATuACT 


PAPPATAXPA APXTPAXPAX 

CAbbATATuA AbTTCATCAT 


P A A A A AXXPP 

tAAAAATTbb 


XPXTP 1 1 1 

TbTi tl IJbt 


lo51 


A P A A P ATPTP 


CPTTPAAAPA AA/2PTPPAAT 

bbl ILAAALA AAbblbLAAl 


P ATT/2P A PTP 

tAI IbbAtIt 


AT/2PT^2/2CPP 

Albblbbbtb 


iQni 

l7Ui 


PTPTTPTP AT 


APPPAPAPTP ATPPXPAXPA 

AbLbAtAbIb AlUbltAIV/A 


PPTT^iPTPAT 

tt 1 1 bb 1 bA 1 


/2PTPAACAA(2 

bt luAAbAAb 




AAAAArSPTTC 
AAAAAuLI lb 


CTAPPCA(2PT PCHATPPAPT 
b 1 AULbAbL 1 UbbA 1 UUAU 1 


ACTPPACTCT 
Ab 1 tt Ab 1 b 1 


ftCT/SHAATTP 
bblubAAl It 


onni 


TfSPACATATP 
IbuAbAIAlU 


AACPTAPTCT PTTPTATP/2A 
AAbu 1 Al^ 1 b 1 L 1 1 U 1 Al LbA 


APAA^iPATCP 
At AAbt A 1 bt 


HATATTT^^PP 
bAIAl 1 Ibtt 




^2APTTAAAAA 
uALl lAAAAA 


CPTPAAPTCP TPPAAAPAAA 
bt 1 l/AAb 1 bL 1 LLAAAbAAA 


AAPPPAACTr^ 
AAttbAAb 1 b 


Pr^PPAACTCT 

tbtt AAb 1 b 1 


civi 


PTCAACAAPA 

L 1 bAAbAAU A 


APTC^^CACTC TPCPTAPTPT 

At 1 bbbAb 1 b 1 Lbt 1 AL 1 L I 


PPPAAAAPPA 
tttAAAAttA 


AAA^^CTPTPP 
AAAbb 1 1 1 tt 


01 CI 


/IIPTPAPTAPP 

bL i uAL 1 Abb 


r^APATPTPA PAPAAPTPPA 

btALAILIbA LAbAAblbbA 


ATPAAPOPTA 

A 1 1 AAbbt 1 A 


CAAA^APTi^P 
bAAAbAt 1 bb 




A APAPPXAXX 

AACAbUlAI 1 


XPTAPXPAXT 1 1 IPPXPPAP 

1 L 1 At 1 bA M t M tt i tbAb 


AAPAPPTTPA 

AAbAtt 1 1 bA 


P ATP A 1 1 1 IP 

tAlbAI 1 1 lb 


2Z51 


AAA AXPr* AXX 

AAAATbbATT 


PXTXAPAPPA XAXAAAAPPA 

tl 1 lAtAbbA TATAAAAGCA 


XXPXXA AP AP 

TTblTAAtAb 


P ATT ATTTPT 

bAI lAI 1 tbi 


2301 


ACAAGATAAT 


GTGAATAAAG ATGCCGTCAC 


AGATAGATTG 


GCnCAGTGG 


2351 


AGACTGATAT 


GCCTCTAACA TTGAGACAGC 


ATAGAATAAG 


TGCGACATCA 


2401 


TCATC6GAAG 


AGAGTAGTAA CAAAGGTCAA 


AGACA6TTGA 


CTGTATCGGG 


2451 


AAHCCCGGG 


GATCTGGCCC CCCCGACCGA 


TGTCAGCCTG 


GGGGACGAGC 


2501 


TCCACHAGA 


CGGCGAGGAC GTGGCGATGG 


CGCATGCCGA 


CGCGCTAGAC 


2551 


GATTTCGATC 


TGGACATGTT GGGG6ACGGG 


GATTCCCCGG 


GTCCGGGAIT 
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2601 TACCCCCCAC GACTCCGCCC CCTACGQCGC TCTGGATATG GCCGACTTCG 
2651 AGITTGAGCA GATGnTACC GATGCCCTTG GAAHGACGA GTACGGTGGG 
2701 GATATCCAGC ACAGTGGCGG CCGCGACGCC GCTGTCACCC CAGAGGAGCG 
2751 CCACCTGTCC AAGATGCAGC AGAACGGCTA CGAAAATCCA ACCTACAAGT 
2801 TCTTTGAGCA GATQCAGAAC TAG 

FIG.37C 



wo 03/072041 



PCTAJS03/05458 



76/76 



(SEQ ID NO: ^2) 

Amino acid sequence of APP(1-651)NFEV, GAL4-VP16(delMl) APP (664-695) 

ml pgl al m aawtaral evptdgnagi 1 aepqi amf cgrl nmhmnvqngkwdsdpsgtktci dtkegl 1 <|ycq 

evypel q1 tnvveanqpvtl qnwckrgrkqckthphf v1 pyrcl vgef i sdal 1 vpdkckf 1 hqermdvcethl h 

whtvaketcsekstnl hdygml 1 pcgi dkf rgvef vccpl aeesdnvdsadaeeddsdvwwggadtdyadgs 

1 

edkwevaeeeevaeveeeeadddeddedgdeveeeaeepyeeaterttsiatttttttesveevvrvpttaastpd 

avdkyi etpgdenehahf qkaker 1 eakhrermsqvmreweeaerqaknl pkadkka v1 qhf qekvesi eqe 

aanerqql vethmarveaml ndrrri al enyl tal qavpprprhvf nml kkyvraeqkdrqhtl khf ehvrmvd 

pkkaaqi rsqvnrthl r v1 yermnqsl si 1 yn vpa vaeel qdevdel 1 qkeqnysddvl aruni sepri sygndal 

mpsl tetkttvel 1 pvngef si ddl qpwhsf gadsvpantenevepvdarpaadrgl ttrpgsgl tn1 kteei sev 
2 3 4 5 

nfev efrhdsqyevhhqkl vf f aedvgsnkgai 1 gl mvq gvviatv i vi tl vral kk kkl gt e l gstspvwwns 

ad ticl 1 ssi eqacdi crl kkl kcskekokcakcl knnwecrvspkt krsol trahl tevesrl erl eql fl 1 i f oredl d 

6 

mi 1 kmdsl qdi kal 1 tol fvqdnvnkdavtdrl asvetctopl tl rqhri satssseessnkqqrqltvsqiDqdl app 

tdvsl qdel hi doedvamahadal ddf dl dml gdqdspgpqftDhdsapvqal dmadfef eomf tdal gi de v 

7 8 
qqdi qhsgaaa daaytpeerhl skmqqng venpt vkf f eqman 

FIG. 38 
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<4D0> 1 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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tctgggttga caaatatcaa gacggaggag atctctgaag tgaatctaga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caagtattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 
tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 
tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 
aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 
aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 
cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 
atgcagaact ag 2352 

<210> 2 
<211> 783 
<212> PRT 

<213> fusion protein - hiiman 



<400> 2 






























Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 


Ala 


Trp 


Thr 


Ala 


Arg 


1 






5 










10 










15 




Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 


Ala 


Gly 


Leu 


Leu 


Ala 


Glu 


Pro 








20 










25 










30 






Gin 


He 


Ala 


Met 


Phe 


Cvs 


Gly 


Arg 


Leu 


Asn 


Met 


His 


Met 


Asn 


Val 


Gin 






35 










40 










45 








Asn 


Gly 


Lvs 


Tro 


Asp 


Ser 


Asp 


Pro 


Ser 


Gly 


Thr 


Lvs 


Thr 


Cys 


He 


Asp 




50 










55 










60 










Thr 


LVS 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


Cvs 


Gin 


Glu 


Val 


Tyr 


Pro 


Glu 


Leu 


65 








70 










75 










80 


Gin 


He 


Thr 


Asn 


Val 


Val 


Glu 


Ala 


Asn 


Gin 


Pro 


Val 


Thr 


He 


Gin 


Asn 










85 










90 










95 






Cvs 


Lvs 


Arcr 


Gly 


Ara 


Lvs 


Gin 


Cys 


Lys 


Thr 


His 


Pro 


His 


Phe 


Val 








100 










105 










110 






He 


Pro 


Tvr 


Arcr 


Cvs 


Leu 


Val 


Gly 


Glu 


Phe 


He 


Ser 


Asp 


Ala 


Leu 


Leu 






115 






120 










125 








Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 


His 


Gin 


Glu 


Arg 


Met 


Asp 


Val 


Cys 




130 










135 










140 










Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


val 


Ala 


Lys 


Glu 


Thr 


Cys 


Ser 


Glu 


145 










150 










155 










160 


Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 


Gly 


Met 


Leu 


Leu 


Pro 


Cys 


Gly 


He 








165 










170 










175 




Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 


Val 


Cys 


Cys 


Pro 


Leu 


Ala 


Glu 


Glu 




180 










185 










190 






Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 


Ala 


Glu 


Glu 


Asp 


Asp 


Ser 


Asp 


Val 




195 










200 










205 








Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 


Tyr 


Ala 


Asp 


Gly 


Ser 


Glu 


Asp 


Lys 


210 








215 










220 










Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


Val 


Ala 


Glu 


Val 


Glu 


Glu 


Glu 


225 










230 










235 










240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Gly 


Asp 


Glu 


Val 


Glu 


Glu 










245 










250 










255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


He 








260 










265 










270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 






275 










280 










285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 










300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


Ala 


His 


Phe 


Gin 


Lys 


Ala 


Lys 


305 








310 










315 










320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 










325 










330 










335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 


Lys 


Asn 


Leu 


Pro 


Lys 


Ala 


Asp 






340 










345 










350 
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Lys 


Lys 


Ala 


Val He Gin His 


Phe Gin Glu Lys Val 


Glu 


Ser 


Leu 


Glu 


355 




360 


365 








Gin 


Glu 


Ala 


Ala Asn Glu Arg 


Gin Gin Leu Val Glu 


Thr 


His 


Met 


Ala 




370 




375 


380 










Arg 


Val 


Glu 


Ala Met Leu Asn 


Asp Arg Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


385 






390 


395 








400 


Tyr 


He 


Thr 


Ala Leu Gin Ala 


Val Pro Pro Arg Pro 


Arg 


His 


Val 


Phe 






405 


410 






415 




Asn 


Met 


Leu 


Lys Lys Tyr Val 


Arg Ala Glu Gin Lys 


Asp 


Arg 


Gin 


His 








420 


425 




430 






Thr 


Leu 


Lys 


His Phe Glu His 


Val Arg Met Val Asp 


Pro 


Lys 


Lys 


Ala 






435 




440 


445 








Ala 


Gin 


lie 


Arg Ser Gin Val 


Met Thr His Leu Arg 


Val 


He 


oyr 


Glu 




450 




455 


460 










Arg 




Asn 


Gin Ser Leu Ser 


Leu Leu Tyx Asn Val 


Pro 


Ala 


Val 


Ala 


465 






470 


475 








480 


Glu 


Glu 


lie 


Gin Asp Glu Val 


Asp Glu Leu Leu Gin 


Lys 


Glu 


Gin 


Asn 








485 


490 






495 




Tyr 


Ser 


Asp 


Asp Val Leu Ala 


Asn Met He Ser Glu 


Pro 


Arg 


He 


Ser 




500 


505 




510 






Tyr 


Gly 


Asn 


Asp Ala Leu Met 


Pro Ser Leu Thr Glu 


Thr 


Lys 


Thr 


Thr 


515 


520 


525 








Val 


Glu 


Leu 


Leu Pro Val Asn 


Gly Glu Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 




530 




535 


540 










Pro 


Trp 


His 


Ser Phe Gly Ala 


Asp Ser Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


545 




550 


555 








560 


Glu 


val 


Glu 


Pro Val Asp Ala 


Arg Pro Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 








565 


570 






575 




Thr 


Arg 


Pro 


Gly Ser Gly Leu 


Thr Asn He Lys Thr 


Glu 


Glu 


He 


Ser 






580 


585 




590 






Glu 


Val 


Asn 


Leu Asp Ala Glu 


Phe Arg His Asp Ser 


Gly 


Tyr 


Glu 


Val 






595 


600 


605 








His 


His 


Gin 


Val Leu Val Phe 


Phe Ala Glu Asp Val 


Gly 


Ser 


Asn 


Lys 




610 




615 


620 










Gly 


Ala 


He 


He Gly Leu Met 


Val Gly Gly Val Val 


He 


Ala 


Thr 


Val 


625 






630 


635 








640 


He 


val 


He 


Thr Leu Val Met 


Leu Lys Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 








645 


650 






655 




Leu 


Gly 


Ser 


Thr Ser Pro Val 


Trp Trp Asn Ser Ala 


Asp 


He 


Leu 


Glu 






660 


665 




670 






Pro 


Val 


Asp 


Pro Arg Leu Glu 


Pro Tip Lys His Pro 


Gly 


Ser 


Gin 


Pro 






675 


680 


685 








Lys 


Thr 


Ala 


Cys Thr Asn Cys 


Tyr Cys Lys Lys Cys 


Cys 


Phe 


His 


Cys 




690 




695 


700 










Gin 


Val 


Cys 


Phe Met Thr Lys 


Ala Leu Gly He Ser 


Tyr 


Gly 


Arg 


Lys 


705 






710 


715 








720 


Lys 


Arg 


Arg 


Gin Arg Arg Arg 


Ala His Gin Asn Ser 


Gin 


Thr 


His 


Gin 






725 


730 






735 




Ala 


Ser 


Leu 


Ser Lys Gin Arg 


He Ser Ser Thr Val 


Ala 


Ala 


Ala 


Asp 








740 


745 




750 






Ala 


Ala 


Val 


Thr Pro Glu Glu 


Arg His Leu Ser Lys 


Met 


Gin 


Gin 


Asn 






755 




760 


765 








Gly 


Tyr 


Glu 


Asn Pro Thr Tyr 


Lys Phe Phe Glu Gin 


Met 


Gin 


Asn 






770 




775 


780 











<210> 3 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 3 
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atgctgcccg 
cccactgatg 
ctgaacatgc 
acctgcattg 
cagatcacca 
ggccgcaagc 
gagtttataa 
atggatgttt 
aagagtacca 
ggggtagagt 
gcggaggagg 
agtgaagaca 
gaagccgatg 
ccctacgaag 
gagtctgtgg 
gacaagtatc 
gagaggcttg 
gcagaacgtc 
caggagaaag 
acacacatgg 
tacatcaccg 
aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
agtccagtgt 
tggaagcatc 
tgctttcatt 
aagcggagac 
aagcagagga 
cacctgtcca 
atgcagaact 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
caggaagtca 
gccaagtttg 
agcgacgaag 
tatccagcac 
agatgcagca 
ag 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gcctaaaact 
tttcatgaca 
agctcatcag 
agtggcggcc 
gaacggctac 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 
ctggagccag 
gcttgtacca 
aaagccttag 
aacagtcaga 
gcagacgccg 
gaaaatccaa 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
tagatcctag 
attgctattg 
gcatctccta 
ctcatcaagc 
ctgtcacccc 
cctacaagtt 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
actagagccc 
taaaaagtgt 
tggcaggaag 
ttctctatca 
agaggagcgc 
ctttgagcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2352 



<210> 4 
<211> 783 
<212> PRT 

<213> fusion protein - hioman 
<400> 4 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala Ala 


Trp 


Thr 


Ala 


Arg 


1 








5 










10 








15 




Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 


Ala 


Gly Leu 


Leu 


Ala 


Glu 


Pro 








20 






25 








30 






Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 


Leu 


Asn 


Met His 


Met 


Asn 


Val 


Gin 






35 








40 








45 








Asn 


Gly 
50 


Lys 


Trp 


Asp 


Ser 


Asp 
55 


Pro 


Ser 


Gly 


Thr Lys 
60 


Thr 


Cys 


He 


Asp 


Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


Cys 


Gin 


Glu Val 


Tyr 


Pro 


Glu 


Leu 


65 








70 










75 








80 


Gin 


He 


Thr 


Asn 


Val 
85 


Val 


Glu 


Ala 


Asn 


Gin 
90 


Pro Val 


Thr 


He 


Gin 
95 


Asn 


Trp 


Cys 


Lys 


Arg 
100 


Gly 


Arg 


Lys 


Gin 


Cys 
105 


Lys 


Thr His 


Pro 


His 
110 


Phe 


Val 


He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 


Glu 


Phe 


He Ser 


Asp 


Ala 


Leu 


Leu 
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115 120 125 



Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 


His 


Gin 


Glu 


Arg 


Met 


Asp 


Val 


Cys 




130 










135 










140 










Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


Val 


Ala 


Lys 


Glu 


Thr 


Cys 


Ser 


Glu 


145 










150 










155 










160 


hya 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 


Gly 


Met 


Leu 


Leu 


Pro 


Cys 


Gly 


He 








165 








170 










175 




Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 


Val 


Cys 


Cys 


Pro 


Leu 


Ala 


Glu 


Glu 








180 










185 










190 






Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 


Ala 


Glu 


Glu 


Asp 


Asp 


Ser 


Asp 


Val 






195 










200 










205 








Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thx 


Asp 


Tyr 


Ala 


Asp 


Gly 


Ser 


Glu 


Asp 


Lys 




210 










215 










220 










Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


Val 


Ala 


Glu 


Val 


Glu 


Glu 


Glu 


225 










230 










235 










240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Gly 


Asp 


Glu 


Val 


Glu 


Glu 










245 










250 










255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 


Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


He 








260 








265 










270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 


Ser 


Val 


Glu 


Glu 


Val 


Val 


Arg 






275 










280 










285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 


Asp 


Ala 


Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 










300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


Ala 


His 


Phe 


Gin 


Lys 


Ala 


Lys 


305 










310 










315 










320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 


Arg 


Met 


Ser 


Gin 


Val 


Met 


Arg 








325 










330 










335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 


Lys 


Asn 


Leu 


Pro 


Lys 


Ala 


Asp 








340 










345 










350 






Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 


Gin 


Glu 


Lys 


Val 


Glu 


Ser 


Leu 


Glu 


355 










360 










365 








Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 


Leu 


Val 


Glu 


Thr 


His 


Met 


Ala 




370 










375 










380 










Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


Arg 


Arg 


Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 










395 










400 


Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 


Pro 


Arg 


Pro 


Arg 


His 


Val 


Phe 








405 










410 










415 




Asn 


Ket 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 


Glu 


Gin 


Lys 


Asp 


Arg 


Gin 


His 








420 










425 










430 






Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 


Met 


val 


Asp 


Pro 


Lys 


Lys 


Ala 






435 










440 










445 








Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 


Thr 


His 


Leu 


Arg 


Val 


He 


Tyr 


Glu 




450 








455 










460 










Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 


Leu 


Tyr 


Asn 


val 


Pro 


Ala 


Val 


Ala 


465 










470 










475 










480 


Glu 


Glu 


He 


Gin 


Asp 


Glu 


Val 


Asp 


Glu 


Leu 


Leu 


Gin 


Lys 


Glu 


Gin 


Asn 










485 










490 










495 




Tyr 


Ser 


Asp 


Asp 


Val 


Leu 


Ala 


Asn 


Met 


He 


Ser 


Glu 


Pro 


Arg 


He 


Ser 




500 










505 










510 






Tyr 


Gly 


Asn 


Asp 


Ala 


Leu 


Met 


Pro 


Ser 


Leu 


Thr 


Glu 


Thr 


Lys 


Thr 


Thr 


515 








520 










525 








val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Gly 


Glu 


Phe 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




530 










535 










540 










Pro 


Trp 


His 


Ser 


Phe 


Gly 


Ala 


Asp 


Ser 


Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


545 








550 










555 










560 


Glu 


Val 


Glu 


Pro 


Val 


Asp 


Ala 


Arg 


Pro 


Ala 


Ala 


Asp 


Arg 


Gly 


Leu 


Thr 










565 








570 










575 




rChr 


Arg 


Pro 


Gly 


Ser 


Gly 


Leu 


Thr 


Asn 


He 


Lys 


Thr 


Glu 


Glu 


He 


Ser 






580 










585 










590 






Glu 


Val 


Lys 


Met 


Asp 


Ala 


Glu 


Phe 


Arg 


His 


Asp 


Ser 


Gly 


Tyr 


Glu 


Val 






595 








600 










605 








His 


His 


Gin 


Val 


Leu 


Val 


Phe 


Phe 


Ala 


Glu 


Asp 


Val 


Gly 


Ser 


Asn 


Lys 
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610 


615 




620 










Gly 


Ala 


He He Gly Leu Met 


Val Gly Gly Val 


Val 


He 


Ala 


Thr 


Val 


625 




630 


635 










640 


lie 


Val 


He Thr Leu Val Met 


Leu Lys Lys Lys 


Lys 


Leu 


Gly 


Thr 


Glu 






645 


650 








655 




Leu 


Gly 


Ser Thr Ser Pro Val 


Trp Trp Asn Ser 


Ala 


Asp 


He 


Leu 


Glu 




660 


665 






670 






Pro 


Val 


Asp Pro Arg Leu Glu 


Pro Trp Lys His 


Pro 


Gly 


Ser 


Gin 


Pro 






675 


680 




685 








Lys 


•Thr 


Ala Cys Thr Asn Cys 


Tyr Cys Lys Lys 


Cys 


Cys 


Phe 


His 


Cys 


690 


695 




700 










Gin 


Val 


Cys Phe Met Thr Lys 


Ala Leu Gly He 


Ser 


Tyr 


Gly 


Arg 


Lys 


705 




710 


715 










720 


Lys 


Arg 


Arg Gin Arg Arg Arg 


Ala His Gin Asn 


Ser 


Gin 


Thr 


His 


Gin 






725 


730 








735 




Ala 


Ser 


Leu Ser Lys Gin Arg 


He Ser Ser Thr 


Val 


Ala 


Ala 


Ala 


Asp 






740 


745 






750 






Ala 


Ala 


Val Thr Pro Glu Glu 


Arg His Leu Ser 


Lys 


Met 


Gin 


Gin 


Asn 






755 


760 




765 








Gly 


Tyr 


Glu Asn Pro Thr Tyr 


Lys Phe Phe Glu 


Gin 


Met 


Gin 


Asn 




770 


775 




780 











<210> 5 
<211> 2823 
<212> DNA 

<213> fusion protein 



<400> 5 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 



htaman 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccG 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaatctaga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
cggatccact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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agtccagtgt ggtggaattc tgcagatatc aagctactgt cttctatcga acaagcatgc 2040 
gatatttgcc gacttaaaaa gctcaagtgc tccaaagaaa aaccgaagtg cgccaagtgt 2100 
ctgaagaaca actgggagtg tcgctactct cccaaaacca aaaggtctcc gctgactagg 2160 
gcacatctga cagaagtgga atcaaggcta gaaagactgg aacagctatt tctactgatt 2220 
tttcctcgag aagaccttga catgattttg aaaatggatt ctttacagga tataaaagca 2280 
ttgttaacag gattatttgt acaagataat gtgaataaag atgccgtcac agatagattg 2340 
gcttcagtgg agactgatat gcctctaaca ttgagacagc atagaataag tgcgacatca 2400 
tcatcggaag agagtagtaa caaaggtcaa agacagttga ctgtatcggg aattcccggg 2460 
gatctggccc ccccgaccga tgtcagcctg ggggacgagc tccacttaga cggcgaggac 2520 
gtggcgatgg cgcatgccga cgcgctagac gatttcgatc tggacatgtt gggggacggg 2580 
gattccccgg gtccgggatt taccccccac gactccgccc cctacggcgc tctggatatg 2640 
gccgacttcg agtttgagca gatgtttacc gatgcccttg gaattgacga gtacggtggg 2700 
gatatccagc acagtggcgg ccgcgacgcc gctgtcaccc cagaggagcg ccacctgtcc 2760 
aagatgcagc agaacggcta cgaaaatcca acctacaagt tctttgagca gatgcagaac 2820 
tag 2823 

<210> 6 
<211> 941 
<212> PRT 

<213> fusion protein - human 
<400> 6 

Met Leu Pro 61y Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 

50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe He Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 

210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 
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Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 










325 








Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 








340 










345 


Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 


Gin 






355 










360 




Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 




370 










375 






Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


385 










390 








Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 










405 










Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 








420 










425 


Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 






435 










440 




Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 


Thr 




450 








455 






Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 


Leu 


465 










470 








Glu 


Glu 


He 


Gin 


Asp 


Glu 


Val 


Asp 


Glu 










485 










Tyr 


Ser 


Asp 


Asp 


Val 


Leu 


Ala 


Asn 


Met 








500 










505 


Tyr 


Gly 


Asn 


Asp 


Ala 


Leu 


Met 


Pro 


Ser 






515 










520 




Val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Gly 


Glu 




530 










535 






Pro 


Trp 


His 


Ser 


Phe 


Gly 


Ala 


Asp 


Ser 


545 










550 








Glu 


Val 


Glu 


Pro 


Val 


Asp 


Ala 


Arg 


Pro 










565 










Thr 


Arg 


Pro 


Gly 


Ser 


Gly 


Leu 


Thr 


Asn 








580 










585 


Glu 


Val 


Asn 


Leu 


Asp 


Ala 


Glu 


Phe 


Arg 






595 










600 




His 


His 


Gin 


Val 


Leu 


Val 


Phe 


Phe 


Ala 




610 










615 






Gly 


Ala 


He 


He 


Gly 


Leu 


Met 


Val 


Gly 


625 










630 








He 


Val 


He 


Thr 


Leu 


Val 


Met 


Leu 


Lys 










645 










Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Val 


Trp 


Trp 








660 










665 


Leu 


Ser 


Ser 


He 


Glu 


Gin 


Ala 


Cys 


Asp 






675 










680 




Lys 


Cys 


Ser 


Lys 


Glu 


Lys 


Pro 


Lys 


Cys 




690 










695 






Trp 


Glu 


Cys 


Arg 


Tyr 


Ser 


Pro 


Lys 


Thr 


705 










710 








Ala 


His 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Arg 










725 










Phe 


Leu 


Leu 


He 


Phe 


Pro 


Arg 


Glu 


Asp 








740 










745 


Asp 


Ser 


Leu 


Gin 


Asp 


He 


Lys 


Ala 


Leu 






755 










760 




Asp 


Asn 


Val 


Asn 


Lys 


Asp 


Ala 


Val 


Thr 




770 










775 






Thr 


Asp 


Met 


Pro 


Leu 


Thr 


Leu 


Arg 


Gin 


785 










790 








Ser 


Ser 


Glu 


Glu 


Ser 


Ser 


Asn 


Lys 


Gly 



805 



PCX/ 




Arg Met Ser Gin 


Val Met 


Arg 


330 


335 




Lys Asn Leu Pro 


Lys Ala 


Asp 




350 




Glu Lys Val Glu 


Ser Leu 


Glu 


365 






Leu Val Glu Thr 


His Met 


Ala 


380 






Arg Arg Leu Ala 


Leu Glu 


Asn 


395 




400 


Pro Arg Pro Arg 


His Val 


Phe 


410 


415 




Glu Gin Lys Asp 


Arg Gin 


His 




430 




Met Val Asp Pro 


Lys Lys 


Ala 


445 






His Leu Arg Val 


He Tyr 


Glu 


460 






Tyr Asn Val Pro 


Ala Val 


Ala 


475 




480 


Leu Leu Gin Lys 


Glu Gin 


Asn 


490 


495 




He Ser Glu Pro 


Arg He 


Ser 




510 




Leu Thr Glu Thr 


Lys Thr 


Thr 


525 






Phe Ser Leu Asp 


Asp Leu 


Gin 


540 






Val Pro Ala Asn 


Thr Glu 


Asn 


555 




560 


Ala Ala Asp Arg 


Gly Leu 


Thr 


570 


575 




He Lys Thr Glu 


Glu He 


Ser 


590 




His Asp Ser Gly 


Tyr Glu 


Val 


605 






Glu Asp Val Gly 


Ser Asn 


Lys 


620 






Gly Val Val He 


Ala Thr 


Val 


635 




640 


Lys Lys Lys Leu 


Gly Thr 


Glu 


650 


655 




Asn Ser Ala Asp 


He Lys 


Leu 




670 




He Cys Arg Leu 


Lys Lys 


Leu 


685 






Ala Lys Cys Leu 


Lys Asn 


Asn 


700 






Lys Arg Ser Pro 


Leu Thr 


Arg 


715 




720 


Leu Glu Arg Leu 


Glu Gin 


Leu 


730 


735 




Leu Asp Met He 


Leu Lys 


Met 


750 




Leu Thr Gly Leu 


Phe Val 


Gin 


765 






Asp Arg Leu Ala 


Ser Val 


Glu 


780 






His Arg He Ser 


Ala Thr 


Ser 


795 




800 


Gin Arg Gin Leu 


Thr Val 


Ser 


810 


815 
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Gly He Pro Gly Asp Leu Ala Pro Pro Thr Asp Val Ser Leu Gly Asp 

820 825 830 

Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His Ala Asp Ala 

835 840 845 

Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly 

850 855 860 

Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met 
865 870 875 880 

Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu Gly He Asp 

885 890 895 

Glu Tyr Gly Gly Asp He Gin His Ser Gly Ala Ala Ala Asp Ala Ala 

900 905 910 

Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn Gly Tyr 

915 920 925 

Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
930 935 940 

<210> 7 
<211> 2823 
<212> DNA 

<213> fusion protein - human 
<400> 7 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cc tgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 102 0 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca gaaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caagtattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc aagctactgt cttctatcga acaagcatgc 2040 
gatatttgcc gacttaaaaa gctcaagtgc tccaaagaaa aaccgaagtg cgccaagtgt 2100 
ctgaagaaca actgggagtg tcgctactct cccaaaacca aaaggtctcc gctgactagg 2160 
gcacatctga cagaagtgga atcaaggcta gaaagactgg aacagctatt tctactgatt 2220 
tttcctcgag aagaccttga catgattttg aaaatggatt ctttacagga tataaaagca 2280 
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ttgttaacag gattatttgt acaagataat gtgaataaag atgccgtcac agatagattg 2340 
gcttcagtgg agactgatat gcctctaaca ttgagacagc atagaataag tgcgacatca 2400 
tcatcggaag agagtagtaa caaaggtcaa agacagttga ctgtatcggg aattcccggg 2460 
gatctggccc ccccgaccga tgtcagcctg ggggacgagc tccacttaga cggcgaggac 2520 
gtggcgatgg cgcatgccga cgcgctagac gatttcgatc tggacatgtt gggggacggg 2580 
gattccccgg gtccgggatt taccccccac gactccgccc cctacggcgc tctggatatg 2640 
gccgacttcg agtttgagca gatgtttacc gatgcccttg gaattgacga gtacggtggg 2700 
gatatccagc acagtggcgg ccgcgacgcc gctgtcaccc cagaggagcg ccacctgtcc 2760 
aagatgcagc agaacggcta cgaaaatcca acctacaagt tctttgagca gatgcagaac 2820 
tag 2823 

<210> 8 
<211> 941 
<212> PRT 

<213> fusion protein - hiunan 
<400> 8 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 

50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe He Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 

210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 

325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
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355 360 365 



Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 


Leu 


Val 


Glu 


Thr 


His 


Met 


Ala 




370 










375 










380 










Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


Arg 


Arg 


Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 










395 










400 


Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 


Pro 


Arg 


Pro 


Arg 


His 


Val 


Phe 










405 










410 








415 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 


Glu 


Gin 


Lys 


Asp 


Arg 


Gin 


His 








420 










425 










430 






Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 


Met 


Val 


Asp 


Pro 


Lys 


Lys 


Ala 






435 










440 








445 








Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 


Thr 


His 


Leu 


Arg 


Val 


He 


Tyr 


Glu 




450 








455 










460 










Arg 


Met 


Asn 


Gin 


Ser 


Leu 


Ser 


Leu 


Leu 


Tyr 


Asn 


Val 


Pro 


Ala 


Val 


Ala 


465 










470 








475 










480 


Glu 


Glu 


He 


Gin 


Asp 


Glu 


Val 


Asp 


Glu 


Leu 


Leu 


Gin 


Lys 


Glu 


Gin 


Asn 










485 










490 










495 




Tyr 


Ser 


Asp 


Asp 


Val 


Leu 


Ala 


Asn 


Met 


He 


Ser 


Glu 


Pro 


Arg 


He 


Ser 




500 










505 










510 






Tyr 


Gly 


Asn 


Asp 


Ala 


Leu 


Met 


Pro 


Ser 


Leu 


Thr 


Glu 


Thr 


Lys 


Thr 


Thr 






515 










520 










525 








Val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Gly 


Glu 


Phe 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




530 










535 










540 










Pro 


Trp 


His 


Ser 


Phe 


Gly 


Ala 


Asp 


Ser 


Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


545 








550 










555 










560 


Glu 


Val 


Glu 


Pro 


Val 


Asp 


Ala 


Arg 


Pro 


Ala 


Ala 


Asp 


Arg 


Gly 


Leu 


Thr 










565 










570 










575 




Thr 


Arg 


Pro 


Gly 


Ser 


Gly 


Leu 


Thr 


Asn 


He 


Lys 


Thr 


Glu 


Glu 


He 


Ser 








580 










585 










590 






Glu 


Val 


Lys 


Met 


Asp 


Ala 


Glu 


Phe 


Arg 


His 


Asp 


Ser 


Gly 


Tyr 


Glu 


Val 






595 










600 










605 








His 


His 


Gin 


Val 


Leu 


Val 


Phe 


Phe 


Ala 


Glu 


Asp 


Val 


Gly 


Ser 


Asn 


Lys 




610 










615 










620 










Gly 


Ala 


He 


He 


Gly 


Leu 


Met 


Val 


Gly 


Gly 


Val 


Val 


He 


Ala 


Thr 


Val 


625 










630 










635 










640 


lie 


Val 


He 


Thr 


Leu 


Val 


Met 


Leu 


Lys 


Lys 


Lys 


Lys 


Leu 


Gly 


Thr 


Glu 










645 










650 










655 




Leu 


Gly 


Ser 


Thr 


Ser 


Pro 


Val 


Trp 


Trp 


Asn 


Ser 


Ala 


Asp 


He 


Lys 


Leu 








660 










665 










670 






Leu 


Ser 


Ser 


He 


Glu 


Gin 


Ala 


Cys 


Asp 


He 


Cys 


Arg 


Leu 


Lys 


Lys 


Leu 






675 










680 










685 








Lys 


Cys 


Ser 


Lys 


Glu 


Lys 


Pro 


Lys 


Cys 


Ala 


Lys 


Cys 


Leu 


Lys 


Asn 


Asn 




690 










695 










700 










Trp 


Glu 


Cys 


Arg 


Tyr 


Ser 


Pro 


Lys 


Thr 


Lys 


Arg 


Ser 


Pro 


Leu 


Thr 


Arg 


705 










710 










715 










720 


Ala 


His 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Arg 


Leu 


Glu 


Arg 


Leu 


Glu 


Gin 


Leu 










725 










730 










735 




Phe 


Leu 


Leu 


He 


Phe 


Pro 


Arg 


Glu 


Asp 


Leu 


Asp 


Met 


He 


Leu 


Lys 


Met 








740 








745 










750 






Asp 


Ser 


Leu 


Gin 


Asp 


He 


Lys 


Ala 


Leu 


Leu 


Thr 


Gly 


Leu 


Phe 


Val 


Gin 




755 






760 










765 








Asp 


Asn 


Val 


Asn 


Lys 


Asp 


Ala 


Val 


Thr 


Asp 


Arg 


Leu 


Ala 


Ser 


val 


Glu 


770 






775 










780 










Thr 


Asp 


Met 


Pro 


Leu 


Thr 


Leu 


Arg 


Gin 


His 


Arg 


He 


Ser 


Ala 


Thr 


Ser 


785 










790 








795 










800 


Ser 


Ser 


Glu 


Glu 


Ser 


Ser 


Asn 


Lys 


Gly 


Gin 


Arg 


Gin 


Leu 


Thr 


Val 


Ser 










805 








810 








815 




Gly 


He 


Pro 


Gly 


Asp 


Leu 


Ala 


Pro 


Pro 


Thr 


Asp 


Val 


Ser 


Leu 


Gly 


Asp 






820 








825 










830 






Glu 


Leu 


His 


Leu 


Asp 


Gly 


Glu 


Asp 


val 


Ala 


Met 


Ala 


His 


Ala 


Asp 


Ala 






835 








840 










845 








Leu 


Asp 


Asp 


Phe 


Asp 


Leu 


Asp 


Met 


Leu 


Gly 


Asp 


Gly 


Asp 


Ser 


Pro 


Gly 
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850 


855 


Pro 


Gly 


Phe Thr Pro His Asp Ser 


865 




870 


Ala 


Asp 


Phe Glu Phe Glu Gin Met 






885 


Glu 


Tyr 


Gly Gly Asp lie Gin His 






900 


Val 


Thr 


Pro Glu Glu Arg His Leu 






915 920 


Glu 


Asn 


Pro Thr Tyr Lys Phe Phe 




930 


935 



860 

Ala Pro Tyr Gly Ala Leu Asp Met 
875 880 
Phe Thr Asp Ala Leu Gly lie Asp 

890 895 
Ser Gly Ala Ala Ala Asp Ala Ala 
905 910 
Ser Lys Met Gin Gin Asn Gly Tyr 
925 

Glu Gin Met Gin Asn 
940 



<210> 9 
<211> 2352 
<212> DMA 

<213> fusion protein - human 
<400> 9 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttataa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatttatg agcgcatgaa tcagtctctc 
gaggagattc aggatgaagt tgatgagctg 
gtcttggcca acatgattag tgaaccaagg 
tctttgaccg aaacgaaaac caccgtggag 
gacgatctcc agccgtggca ttcttttggg 
gaagttgagc ctgttgatgc ccgccctgct 
tctgggttga caaatatcaa gacggaggag 
cgacatgact caggatatga agttcatcat 
ggttcaaaca aaggtgcaat cattggactc 
atcgtcatca ccttggtgat gctgaagaag 
agtccagtgt ggtggaattc tgcagatatc 
tggaagcatc caggaagtca gcctaaaact 
tgctttcatt gccaagtttg tttcatgaca 
aagcggagac agcgacgaag agctcatcag 
aagcagagga tatccagcac agtggcggcc 
cacctgtcca agatgcagca gaacggctac 
atgcagaact ag 

<210> 10 
<211> 783 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 1140 
aatgaccgcc gccgcctggc cctggagaac 1200 
cggcctcgtc acgtgttcaa tatgctaaag 1260 
cagcacaccc taaagcattt cgagcatgtg 1320 
atccggtccc aggttatgac acacctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
cttcagaaag agcaaaacta ttcagatgac 1500 
atcagttacg gaaacgatgc tctcatgcca 1560 
ctccttcccg tgaatggaga gttcagcctg 1620 
gctgactctg tgccagccaa cacagaaaac 1680 
gccgaccgag gactgaccac tcgaccaggt 1740 
atctctgaag tgaatctaga tgcagaattc 1800 
caaaaattgg tgttctttgc agaagatgtg 1860 
atggtgggcg gtgttgtcat agcgacagtg 1920 
aaaaagcttg gtaccgagct cggatccact 1980 
ctggagccag tagatcctag actagagccc 2040 
gcttgtacca attgctattg taaaaagtgt 2100 
aaagccttag gcatctccta tggcaggaag 2160 
aacagtcaga ctcatcaagc ttctctatca 2220 
gcagacgccg ctgtcacccc agaggagcgc 2280 
gaaaatccaa cctacaagtt ctttgagcag 2340 

2352 
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<212> PRT 

<213> fusion protein - huxnan 



<400> 10 



Met 


Leu 


Pro 


Gly 


Leu 
5 


Ala 


Leu 


Leu 


Leu 


1 
Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 


Asn 








20 










25 


Gin 


lie 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 


Leu 






35 










40 




Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp 


Pro 


Ser 




50 










55 






Hhr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


Cys 


65 










70 








Gin 


lie 


Thr 


Asn 


val 


Val 


Glu 


Ala 


Asn 










85 










Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 


Cys 








100 










105 


lie 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 


Glu 






115 










120 




Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 


His 




130 










135 






Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


Val 


145 










150 








Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 


Gly 








165 










Asp 


Lys 


Phe 


Arg 


Gly 


val 


Glu 


Phe 


Val 








180 










185 


Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 


Ala 






195 










200 




Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 


Tyr 




210 










215 






Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


Glu 


225 










230 








Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 


Glu 










245 










Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 


Ala 








260 










265 


Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Glu 






275 










280 




Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 


Pro 




290 










295 






Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


His 


305 










310 








Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 


Glu 










325 










Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Ala 






340 










345 


Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 


Gin 






355 










360 




Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 


Gin 




370 










375 






Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


Arg 


385 










390 








Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 


Pro 








405 










Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 


Ala 








420 










425 


Thr 


Leu 


Lys 


His 


Phe 


Glu 


His 


Val 


Arg 






435 










440 




Ala 


Gin 


He 


Arg 


Ser 


Gin 


Val 


Met 


Thr 




450 








455 







PCT/US03/05458 



Leu Ala Ala 


Trp 


Thr 


Ala 


Arg 


10 






15 


Ala Gly Leu 


Leu 


Ala 
30 


Glu 


Pro 


Asn Met His 


Met 
45 


Asn 


Val 


Gin 


Gly Thr Lys 


Thr 


Cys 


He 


Asp 


60 










Gin Glu Val 


Tyr 


Pro 


Glu 


Leu 


75 








80 


Gin Pro Val 


Thr 


He 


Gin 


Asn 


90 






95 




Lys Thr His 


Pro 


His 
110 


Phe 


Val 


Phe He Ser 


Asp 
125 


Ala 


Leu 


Leu 


Gin Glu Arg 


Met 


Asp 


Val 


Cys 


140 










Ala Lys Glu 


Thr 


Cys 


Ser 


Glu 


155 








160 


Met Leu Leu 


Pro 


Cys 


Gly 


He 


170 






175 




Cys Cys Pro 


Leu 


Ala 


Glu 


Glu 




190 






Glu Glu Asp 


Asp 
205 


Ser 


Asp 


Val 


Ala Asp Gly 


Ser 


Glu 


Asp 


Lys 


220 










Val Ala Glu 


val 


Glu 


Glu 


Glu 


235 








240 


Asp Gly Asp 


Glu 


Val 


Glu 


Glu 


250 






255 




Thr Glu Arg 


Thr 


Thr 
270 


Ser 


He 


Ser Val Glu 


Glu 
285 


Val 


Val 


Arg 


Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 


300 










Ala His Phe 


Gin 


Lys 


Ala 


Lys 


315 








320 


Arg Met Ser 


Gin 


Val 


Met 


Arg 


330 






335 




Lys Asn Leu 


Pro 


Lys 
350 


Ala 


Asp 


Glu Lys Val 


Glu 


Ser 


Leu 


Glu 


365 








Leu Val Glu 


Thr 


His 


Met 


Ala 


380 










Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


395 








400 


Pro Arg Pro 


Arg 


His 


Val 


Phe 


410 






415 




Glu Gin Lys 


Asp 


Arg 


Gin 


His 




430 






Met Val Asp 


Pro 
445 


Lys 


Lys 


Ala 


His Leu Arg 


Val 


He 


Tyr 


Glu 



460 



-13- 



21040 



wo 03/072041 ' ' " PCT/US03/05458 ' 



Arg 


Met 


Asn 


Gin Ser Leu Ser Leu Leu 


Tyr Asn Val 


Pro 


Ala 


Val 


Ala 


465 






470 


475 








480 


Glu 


Glu 


He 


Gin Asp Glu Val Asp Glu 


Leu Leu Gin 


Lys 


Glu 


Gin 


Asn 








485 


490 






495 




Tyr 


Ser 


Asp 


Asp Val Leu Ala Asn Met 


He Ser Glu 


Pro 


Arg 


He 


Ser 








500 505 






510 






Tyr 


Gly 


Asn 


Asp Ala Leu Met Pro Ser 


Leu Thr Glu 


Thr 


Lys 


Thr 


Thr 






515 


520 




525 








Val 


Glu 


Leu 


Leu Pro Val Asn Gly Glu 


Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 




530 




535 


540 








Pro 


Trp 


His 


Ser Phe Gly Ala Asp Ser 


Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


545 






550 


555 








560 


Glu 


Val 


Glu 


Pro Val Asp Ala Arg Pro 


Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 








565 


570 






575 




Thr 


Arg 


Pro 


Gly Ser Gly Leu Thr Asn 


He Lys Thr 


Glu 


Glu 


He 


Ser 








580 585 






590 






Glu 


Val 


Asn 


Leu Asp Ala Glu Phe Arg 


His Asp Ser 


Gly 


Tyr 


Glu 


Val 






595 


600 


605 








His 


His 


Gin 


Lys Leu Val Phe Phe Ala 


Glu Asp Val 


Gly 


Ser 


Asn 


Lys 




610 




615 


620 










Gly 


Ala 


He 


He Gly Leu Met Val Gly 


Gly Val Val 


He 


Ala 


Thr 


Val 


625 






630 


635 








640 


lie 


Val 


He 


Thr Leu Val Met Leu Lys 


Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 








645 


650 






655 




Leu 


Gly 


Ser 


Thr Ser Pro Val Trp Trp 


Asn Ser Ala 


Asp 


He 


Leu 


Glu 








660 665 






670 






Pro 


Val 


Asp 


Pro Arg Leu Glu Pro Trp 


Lys His Pro 


Gly 


Ser 


Gin 


Pro 






675 


680 




685 








Lys 


Thr 


Ala 


Cys Thr Asn Cys Tyr Cys 


Lys Lys Cys 


Cys 


Phe 


His 


Cys 




690 




695 


700 










Gin 


Val 


Cys 


Phe Met Thr Lys Ala Leu 


Gly He Ser 


Tyr 


Gly 


Arg 


Lys 


705 






710 


715 








720 


Lys 


Arg 


Arg 


Gin Arg Arg Arg Ala His 


Gin Asn Ser 


Gin 


Thr 


His 


Gin 








725 


730 






735 




Ala 


Ser 


Leu 


Ser Lys Gin Arg He Ser 


Ser Thr Val 


Ala 


Ala 


Ala 


Asp 








740 745 
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<210> 11 
<211> 2352 
<212> DNA 

<213> fusion protein - human 
<400> 11 

atgctgcccg gtttggcact gctcctgctg gccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatgc acatgaatgt ccagaatggg aagtgggatt cagatccatc agggaccaaa 180 
acctgcattg ataccaagga aggcatcctg cagtattgcc aagaagtcta ccctgaactg 240 
cagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgcaagcgg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgattc cctaccgctg cttagttggt 360 
gagtttataa gtgatgccct tctcgttcct gacaagtgca aattcttaca ccaggagagg 420 
atggatgttt gcgaaactca tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtagagt ttgtgtgttg cccactggct gaagaaagtg acaatgtgga ttctgctgat 600 
gcggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 780 
ccctacgaag aagccacaga gagaaccacc agcattgcca ccaccaccac caccaccaca 840 
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gagtctgtgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
gcagaacgtc aagcaaagaa cttgcctaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag tggaatcttt ggaacaggaa gcagccaacg agagacagca gctggtggag 1140 
acacacatgg ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctggagaac 1200 
tacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 1260 
aagtatgtcc gcgcagaaca geiaggacaga cagcacaccc taaagcattt cgagcatgtg 1320 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggttatgac acacctccgt 1380 
gtgatttatg agcgcatgaa tcagtctctc tccctgctct acaacgtgcc tgcagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 1500 
gtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc tctcatgcca 1560 
tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 
gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 
gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 
tctgggttga caaatatcaa gacggaggag atctctgaag tgaagatgga tgcagaattc 1800 
cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 
ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 
atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 
agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 
tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 
tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 
aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 
aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 
cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 
atgcagaact ag 2352 

<210> 12 
<211> 783 
<212> PRT 

<213> fusion protein - hiunan 
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Leu 


Gin 


Lys 


Glu 


Gin 


Asn 


490 










495 




He 


Ser 


Glu 


Pro 


Arg 


He 


Ser 










510 






Leu 


Thr 


Glu 


Thr 


Lys Thr 


Thr 








525 








Phe 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 






540 










Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 




555 










560 


Ala 


Ala 


Asp 


Arg 


Gly Leu 


Thr 


570 








575 




He 


Lys 


Thr 


Glu 


Glu 
con 


He 


Ser 


His 


Asp 


Ser 


Gly 


Tyr Glu 


Val 






605 








Glu 


Asp 


val 


Gly 


Ser Asn 


Lys 




620 










Gly 


Val 


Val 


He 


Ala Thr 


Val 


635 










640 


Lys 


Lys 


Lys 


Leu 


Gly Thr 


Glu 


650 










655 




Asn 


Ser 


Ala 


Asp 


He 


Leu 


Glu 










670 






Lys 


His 


Pro 


Gly 


Ser 


Gin 


Pro 






685 








Lys 


Lys 


Cys 


Cys 


Phe 


His 


Cys 






700 










Gly 


He 


Ser 


Tyr 


Gly Arg 


Lys 


715 










720 


Gin 


Asn 


Ser 


Gin 


Thr His 


Gin 
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725 730 735 

Ala Ser Leu Ser Lys Gin Arg He Ser Sex Thr Val Ala Ala Ala Asp 

740 745 750 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 

755 760 765 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
770 775 780 



<210> 13 
<211> 2823 
<212> DNA 

<213> fusion protein - hxjman 
<400> 13 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttataa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatttatg agcgcatgaa tcagtctctc 
gaggagattc aggatgaagt tgatgagctg 
gtcttggcca acatgattag tgaaccaagg 
tctttgaccg aaacgaaaac caccgtggag 
gacgatctcc agccgtggca ttcttttggg 
gaagttgagc ctgttgatgc ccgccctgct 
tctgggttga caaatatcaa gacggaggag 
cgacatgact caggatatga agttcatcat 
ggttcaaaca aaggtgcaat cattggactc 
atcgtcatca ccttggtgat gctgaagaag 
agtccagtgt ggtggaattc tgcagatatc 
gatatttgcc gacttaaaaa gctcaagtgc 
ctgaagaaca actgggagtg tcgctactct 
gcacatctga cagaagtgga atcaaggcta 
tttcctcgag aagaccttga catgattttg 
ttgttaacag gattatttgt acaagataat 
gcttcagtgg agactgatat gcctctaaca 
tcatcggaag agagtagtaa caaaggtcaa 
gatctggccc ccccgaccga tgtcagcctg 
gtggcgatgg cgcatgccga cgcgctagac 
gattccccgg gtccgggatt taccccccac 
gccgacttcg agtttgagca gatgtttacc 
gatatccagc acagtggcgg ccgcgacgcc 
aagatgcagc agaacggcta cgaaaatcca 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 1140 
aatgaccgcc gccgcctggc cctggagaac 1200 
cggcctcgtc acgtgttcaa tatgctaaag 1260 
cagcacaccc taaagcattt cgagcatgtg 1320 
atccggtccc aggttatgac acacctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
cttcagaaag agcaaaacta ttcagatgac 1500 
atcagttacg gaaacgatgc tctcatgcca 1560 
ctccttcccg tgaatggaga gttcagcctg 1620 
gctgactctg tgccagccaa cacagaaaac 1680 
gccgaccgag gactgaccac tcgaccaggt 1740 
atctctgeiag tgaatctaga tgcagaattc 1800 
caaaaattgg tgttctttgc agaagatgtg 1860 
atggtgggcg gtgttgtcat agcgacagtg 1920 
aaaaagcttg gtaccgagct cggatccact 1980 
aagctactgt cttctatcga acaagcatgc 2040 
tccaaagaaa aaccgaagtg cgccaagtgt 2100 
cccaaaacca aaaggtctcc gctgactagg 2160 
gaaagactgg aacagctatt tctactgatt 2220 
aaaatggatt ctttacagga tataaaagca 2280 
gtgaataaag atgccgtcac agatagattg 2340 
ttgagacagc atagaataag tgcgacatca 2400 
agacagttga ctgtatcggg aattcccggg 2460 
ggggacgagc tccacttaga cggcgaggac 2520 
gatttcgatc tggacatgtt gggggacggg 2580 
gactccgccc cctacggcgc tctggatatg 2640 
gatgcccttg gaattgacga gtacggtggg 2700 
gctgtcaccc cagaggagcg ccacctgtcc 2760 
acctacaagt tctttgagca gatgcagaac 2820 
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tag 

<210> 14 
<211> 941 
<212> PRT 

<213> fusion protein - human 



<400> 14 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu 


Leu 


1 








5 








Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp 


Gly 








20 








Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly 


Arg 






35 










40 


Asn 


Gly 


Lys 


Tip 


Asp 


Ser 


Asp 


Pro 




50 










55 




Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin 


Tyr 


65 










70 






Gin 


He 


Thr 


Asn 


Val 


Val 


Glu 


Ala 










85 








Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys 


Gin 








100 










He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val 


Gly 






115 










120 


Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe 


Leu 




130 










135 




Glu 


Thr 


His 


Leu 


His 


Trp 


His 


Thr 


145 










150 






Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp 


Tyr 










165 






Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu 


Phe 








180 










Ser 


Asp 


Asn 


Val 


Asp 


Ser 


Ala 


Asp 






195 










200 


Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr 


Asp 




210 










215 




Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu 


Glu 


225 










230 






Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp 


Asp 










245 








Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu 


Glu 








260 








Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 






275 










280 


Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser 


Thr 




290 










295 




Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn 


Glu 


305 








310 






Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His 


Arg 










325 








Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 








340 








Lys 


Lys 


Ala 


Val 


He 


Gin 


His 


Phe 




355 










360 


Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg 


Gin 




370 










375 




Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn 


Asp 


385 










390 






Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala 


Val 










405 








Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val 


Arg 



420 



2823 



Leu Leu Ma Ala 


Trp Thr 


Ala 


Arg 


10 


15 




Asn Ala Gly Leu 


Leu Ala 


Glu 


Pro 


25 


30 






Leu Asn Met His 


Met Asn 


Val 


Gin 




45 






Ser Gly Thr Lys 


Thr Cys 


He 


Asp 


60 








Cys Gin Glu Val 


Tyr Pro 


Glu 


Leu 


75 






80 


Asn Gin Pro Val 


Thr He 


Gin 


Asn 


90 




95 




Cys Lys Thr His 


Pro His 


Phe 


Val 


105 


110 






Glu Phe He Ser 


Asp Ala 


Leu 


Leu 




125 






His Gin Glu Arg 


Met Asp 


Val 


Cys 


140 








Val Ala Lys Glu 


Thr Cys 


Ser 


Glu 


155 




160 


Gly Met Leu Leu 


Pro Cys 


Gly 


He 


170 




175 




Val Cys Cys Pro 


Leu Ala 


Glu 


Glu 


185 


190 






Ala Glu Glu Asp 


Asp Ser 


Asp 


Val 




205 






Tyx Ala Asp Gly 


Ser Glu 


Asp 


Lys 


220 








Glu Val Ala Glu 


Val Glu 


Glu 


Glu 


235 






240 


Glu Asp Gly Asp 


Glu Val 


Glu 


Glu 


250 




255 




Ala Thr Glu Arg 


Thr Thr 


Ser 


He 


265 


270 






Glu Ser Val Glu 


Glu Val 


Val 


Arg 




285 






Pro Asp Ala Val 


Asp Lys 


Tyr 


Leu 


300 








His Ala His Phe 


Gin Lys 


Ala 


Lys 


315 




320 


Glu Arg Met Ser 


Gin Val 


Met 


Arg 


330 




335 




Ala Lys Asn Leu 


Pro Lys 


Ala 


Asp 


345 


350 






Gin Glu Lys Val 


Glu Ser 


Leu 


Glu 




365 






Gin Leu Val Glu 


Thr His 


Met 


Ala 


380 








Arg Arg Arg Leu 


Ala Leu 


Glu 


Asn 


395 






400 


Pro Pro Arg Pro 


Arg His 


Val 


Phe 


410 


415 




Ala Glu Gin Lys 


Asp Arg 


Gin 


His 


425 


430 
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Thr 


Leu 


Lys 


His 


Phe Glu His 


Val Arg 






435 






440 


Ala 


Gin 


He 


Arg 


Ser Gin Val 


Met Thr 




450 






455 




Arg 


Met 


Asn 


Gin 


Ser Leu Ser 


Leu Leu 


465 








470 




Glu 


Glu 


He 


Gin 


Asp Glu Val 


Asp Glu 










485 


Tyr 


Ser 


Asp 


Asp 


Val Leu Ala 


Asn Met 








500 




505 


Tyr 


Gly 


Asn 


Asp 


Ala Leu Met 


Pro Ser 






515 






520 


Val 


Glu 


Leu 


Leu 


Pro Val Asn 


Gly Glu 




530 






535 


Pro 


Trp 


His 


Ser 


Phe Gly Ala 


Asp Ser 


545 








550 




Glu 


Val 


Glu 


Pro 


Val Asp Ala 


Arg Pro 










565 


Thr 


Arg 


Pro 


Gly 


Ser Gly Leu 


Thr Asn 








580 




585 


Glu 


Val 


Asn 


Leu 


Asp Ala Glu 


Phe Arg 






595 






600 


His 


His 


Gin 


Lys 


Leu Val Phe 


Phe Ala 




610 






615 




Gly 


Ala 


He 


He 


Gly Leu Met 


Val Gly 


625 








630 




He 


Val 


He 


Thr 


Leu Val Met 


Leu Lys 










645 


Leu 


Gly 


Ser 


Thr 


Ser Pro Val 


Trp Trp 








660 




665 


Leu 


Ser 


Ser 


He 


Glu Gin Ala 


Cys Asp 






675 






680 


Lys 


Cys 


Ser 


Lys 


Glu Lys Pro 


Lys Cys 




690 






695 




Trp 


Glu 


Cys 


Arg 


Tyr Ser Pro 


Lys Thr 


705 








710 




Ala 


His 


Leu 


Thr 


Glu Val Glu 


Ser Arg 










725 




Phe 


Leu 


Leu 


He 


Phe Pro Arg 


Glu Asp 








740 




745 


Asp 


Ser 


Leu 


Gin 


Asp He Lys 


Ala Leu 






755 






760 


Asp 


Asn 


Val 


Asn 


Lys Asp Ala 


Val Thr 




770 






775 




Thr 


Asp 


Met 


Pro 


Leu Thr Leu 


Arg Gin 


785 








790 




Ser 


Ser 


Glu 


Glu 


Ser Ser Asn 


Lys Gly 










805 




Gly 


He 


Pro 


Gly 


Asp Leu Ala 


Pro Pro 








820 




825 


Glu 


Leu 


His 


Leu 


Asp Gly Glu 


Asp Val 






835 






840 


Leu 


Asp 


Asp 


Phe 


Asp Leu Asp 


Met Leu 




850 






855 




Pro 


Gly 


Phe 


Thr 


Pro His Asp 


Ser Ala 


865 








870 




Ala 


Asp 


Phe 


Glu 


Phe Glu Gin 


Met Phe 










885 




Glu 


Tyr 


Gly 


Gly 


Asp He Gin 


His Ser 








900 




905 


Val 


Thr 


Pro 


Glu 


Glu Arg His 


Leu Ser 






915 




920 
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Met Val Asp 


Pro 


Lys 


Lys 


Ala 




445 








His Leu Arg 


Val 


He 


Tyr 


Glu 


460 










Tyr Asn Val 


Pro 


Ala 


Val 


Ala 


475 








480 


Leu Leu Gin 


Lys 


Glu 


Gin 


Asn 


490 






495 




He Ser Glu 


Pro 


Arg 


He 


Ser 






510 






Leu Thr Glu 


Thr 


Lys 


Thr 


Thr 




525 






Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 


540 










Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


555 








560 


Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 


570 






575 




He Lys Thr 


Glu 


Glu 


He 


Ser 






590 






His Asp Ser 


Gly 


Tyr 


Glu 


Val 




605 








Glu Asp Val 


Gly 


Ser 


Asn 


Lys 


620 










Gly Val Val 


He 


Ala 


Thr 


Val 


635 








640 


Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 


650 






655 




Asn Ser Ala 


Asp 


He 


Lys 


Leu 






670 






He Cys Arg 


Leu 


Lys 


Lys 


Leu 




685 








Ala Lys Cys 


Leu 


Lys 


Asn 


Asn 


700 










Lys Arg Ser 


Pro 


Leu 


Thr 


Arg 


715 








720 


Leu Glu Arg 


Leu 


Glu 


Gin 


Leu 


730 






735 




Leu Asp Met 


He 


Leu 


Lys 


Met 






750 






Leu Thr Gly 


Leu 


Phe 


Val 


Gin 




765 








Asp Arg Leu 


Ala 


Ser 


Val 


Glu 


780 










His Arg He 


Ser 


Ala 


Thr 


Ser 


795 








800 


Gin Arg Gin 


Leu 


Thr 


Val 


Ser 


810 






815 




Thr Asp Val 


Ser 


Leu 


Gly 


Asp 






830 






Ala Met Ala 


His 


Ala 


Asp 


Ala 




845 








Gly Asp Gly 


Asp 


Ser 


Pro 


Gly 


860 










Pro Tyr Gly 


Ala 


Leu 


Asp 


Met 


875 








880 


Thr Asp Ala 


Leu 


Gly 


He 


Asp 


890 






895 




Gly Ala Ala 


Ala 


Asp 


Ala 


Ala 






910 






Lys Met Gin 


Gin 


Asn 


Gly 


Tyr 



925 
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Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
930 935 940 



<210> 15 
<211> 2823 
<212> DNA 

<213> fusion protein - human 



<400> 15 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

gatatttgcc 

ctgaagaaca 

gcacatctga 

tttcctcgag 

ttgttaacag 

gcttcagtgg 

tcatcggaag 

gatctggccc 

gtggcgatgg 

gattccccgg 

gccgacttcg 

gatatccagc 

aagatgcagc 

tag 

<210> 16 
<211> 941 
<212> PRT 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
ceiaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 
gattatttgt 
agactgatat 
agagtagtaa 
ccccgaccga 
cgcatgccga 
gtccgggatt 
agtttgagca 
acagtggcgg 
agaacggcta 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 
acaagataat 
gcctctaaca 
caaaggtcaa 
tgtcagcctg 
cgcgctagac 
taccccccac 
gatgtttacc 
ccgcgacgcc 
cgaaaatcca 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 
gtgaataaag 
ttgagacagc 
agacagttga 
ggggacgagc 
gatttcgatc 
gactccgccc 
gatgcccttg 
gctgtcaccc 
acctacaagt 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaagatgga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
tgcagaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgcgacatca 
aattcccggg 
cggcgaggac 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgcagaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2823 
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<213> fusion protein - human 



<400> 16 



Met 


Leu 


Pro 


Gly Leu Ala 


Leu 


Leu 


1 






5 






Ala 


Leu 


Glu 


Val Pro Thr 


Asp 


Gly 








20 






Gin 


He 


Ala 


Met Phe Cys 


Gly 


Arg 






35 






40 


Asn 


Gly 


Lys 


Trp Asp Ser 


Asp 


Pro 




50 






55 




Thr 


Lys 


Glu 


Gly He Leu 


Gin 


Tyr 


65 




70 






Gin 


He 


Thr 


Asn Val Val 


Glu 


Ala 








85 






Trp 


Cys 


Lys 


Arg Gly Arg 


Lys 


Gin 








100 






He 


Pro 


Tyr 


Arg Cys Leu 


Val 


Gly 






115 






120 


Val 


Pro 


Asp 


Lys Cys Lys 


Phe 


Leu 




130 






135 




Glu 


Thr 


His 


Leu His Trp 


His 


Thr 


145 






150 






Lys 


Ser 


Thr 


Asn Leu His 


Asp 


Tyr 






165 






Asp 


Lys 


Phe 


Arg Gly Val 


Glu 


Phe 








180 






Ser 


Asp 


Asn 


Val Asp Ser 


Ala 


Asp 






195 






200 


Trp 


Trp 


Gly 


Gly Ala Asp 


Thr 


Asp 




210 






215 




Val 


Val 


Glu 


Val Ala Glu 


Glu 


Glu 


225 






230 






Glu 


Ala 


Asp 


Asp Asp Glu 


Asp 


Asp 








245 






Glu 


Ala 


Glu 


Glu Pro Tyr 


Glu 


Glu 








260 






Ala 


Thr 


Thr 


Thr Thr Thr 


Thr 


Thr 






275 






280 


Val 


Pro 


Thr 


Thr Ala Ala 


Ser 


Thr 




290 






295 




Glu 


Thr 


Pro 


Gly Asp Glu 


Asn 


Glu 


305 






310 






Glu 


Arg 


Leu 


Glu Ala Lys 


His 


Arg 






325 






Glu 


Trp 


Glu 


Glu Ala Glu 


Arg 


Gin 






340 






iiys 


Lys 


Ala 


Val He Gin 


His 


Phe 


355 






360 


Gin 


Glu 


Ala 


Ala Asn Glu 


Arg 


Gin 




370 






375 




Arg 


Val 


Glu 


Ala Met Leu 


Asn 


Asp 


385 






390 






Tyr 


He 


Thr 


Ala Leu Gin 


Ala 


Val 






405 






Asn 


Met 


Leu 


Lys Lys Tyr 


Val 


Arg 








420 






Thr 


Leu 


Lys 


His Phe Glu 


His 


Val 






435 






440 


Ala 


Gin 


He 


Arg Ser Gin 


Val 


Met 




450 




455 




Arg 


Met 


Asn 


Gin Ser Leu 


Ser 


Leu 



Leu Leu Ala Ala 


Trp 


Thr 


Ala 


Arg 


10 




15 




Asn Ala Gly Leu 


Leu 


Ala 


Glu 


Pro 


25 




30 






Leu Asn Met His 


Met 


Asn Val 


Gin 




45 








Ser Gly Thr Lys 


Thr 


Cys 


He 


Asp 


60 










Cys Gin Glu Val 


Tyi^ 


Pro 


Glu 


Leu 


75 








80 


Asn Gin Pro Val 


Thr 


He 


Gin 


Asn 


90 






95 




Cys Lys Thr His 


Pro 


His 


Phe 


Val 


105 




110 






Glu Phe He Ser 


Asp 

125 


Ala 


Leu 


Leu 


His Gin Glu Arg 


Met 


Asp Val 


Cys 


140 










Val Ala Lys Glu 


Thr 


Cys 


Ser 


Glu 


155 








160 


Gly Met Leu Leu 


Pro 


Cys Gly 


He 


170 






175 




Val Cys Cys Pro 


Leu 


Ala 


Glu 


Glu 


185 




190 






Ala Glu Glu Asp 


Asp 
205 


Ser 


Asp 


Val 


Tyr Ala Asp Gly 


Ser 


Glu Asp 


Lys 


220 










Glu Val Ala Glu 


Val 


Glu 


Glu 


Glu 


235 








240 


Glu Asp Gly Asp 


Glu 


Val 


Glu 


Glu 


250 






255 




Ala Thr Glu Arg 


Thr 


Thr 


Ser 


He 


265 




270 






Glu Ser Val Glu 


Glu 
285 


Val 


Val 


Arg 


Pro Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 


300 










His Ala His Phe 


Gin 


Lys 


Ala 


Lys 


315 








320 


Glu Arg Met Ser 


Gin 


Val 


Met 


Arg 


330 






335 




Ala Lys Asn Leu 


Pro 


Lys 


Ala 


Asp 


345 




350 






Gin Glu Lys Val 


Glu 


Ser 


Leu 


Glu 


365 








Gin Leu Val Glu 


Thr 


His 


Met 


Ala 


380 










Arg Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


395 








400 


Pro Pro Arg Pro 


Arg 


His Val 


Phe 


410 






415 




Ala Glu Gin Lys 


Asp 


Arg Gin 


His 


425 




430 






Arg Met Val Asp 


Pro 


Lys 


Lys 


Ala 


445 








Thr His Leu Arg 


Val 


He Tyr 


Glu 


460 










Leu Tyr Asn Val 


Pro 


Ala Val 


Ala 
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465 






470 




Glu 


Glu 


He 


Gin Asp Glu Val 


Asp Glu 








485 




Tyr 


Ser 


Asp 


Asp Val Leu Ala 


Asn Met 








500 


505 


Tyr 


Gly 


Asn 


Asp Ala Leu Met 


Pro Ser 






515 




520 


Val 


Glu 


Leu 


Leu Pro Val Asn 


Gly Glu 




530 




535 




Pro 


Trp 


His 


Ser Phe Gly Ala 


Asp Ser 


545 






550 




Glu 


Val 


Glu 


Pro Val Asp Ala 


Arg Pro 








565 




Thr 


Arg 


Pro 


Gly Ser Gly Leu 


Thr Asn 






580 


585 


Glu 


Val 


Lys 


Met Asp Ala Glu 


Phe Arg 






595 




600 


His 


His 


Gin 


Lys Leu Val Phe 


Phe Ala 




610 




615 




Gly 


Ala 


He 


He Gly Leu Met 


Val Gly 


625 






630 




lie 


Val 


He 


Thr Leu Val Met 


Leu Lys 








645 




Leu 


Glv 


Ser 


Thr Ser Pro Val 


Tro Tro 








660 


665 


Leu 


Ser 


Ser 


He Glu Gin Ala 


Cys Asp 






675 




680 


Lvs 


Cys 


Ser 


Lys Glu Lys Pro 


Lys Cys 




690 




695 




Trp 


Glu 


Cys 


Arg Tyr Ser Pro 


Lys Thr 


705 






710 




Ala 


His 


Leu 


Thr Glu Val Glu 


Ser Arg 








725 




Phe 


Leu 


Leu 


He Phe Pro Arg 


Glu Asp 








740 


745 


Asp 


Ser 


Leu 


Gin Asp He Lys 


Ala Leu 






755 




760 


ASD 


Aszi 


Val 


Asn Lys Asp Ala 


Val Thr 




770 




775 




Thr 


Asp 


Met 


Pro Leu Thr Leu 


Arg Gin 


785 




790 




Ser 


Ser 


Glu 


Glu Ser Ser Asn 


Lys Gly 








805 




Gly 


He 


Pro 


Gly Asp Leu Ala 


Pro Pro 






820 


825 


Glu 


Leu 


His 


Leu Asp Gly Glu 


Asp Val 






835 




840 


Leu 


Asp 


Asp 


Phe Asp Leu Asp 


Met Leu 




850 




855 




Pro 


Gly 


Phe 


Thr Pro His Asp 


Ser Ala 


865 






870 




Ala 


Asp 


Phe 


Glu Phe Glu Gin 


Met Phe 








885 




Glu 


Tyr 


Gly 


Gly Asp He Gin 


His Ser 








900 


905 


Val 


Thr 


Pro 


Glu Glu Arg His 


Leu Ser 






915 




920 


Glu 


Asn 


Pro 


Thr Tyr Lys Phe 


Phe Glu 




930 




935 





<210> 17 



P C uSqi 3 / " " 

PCT/US03/05458 



475 






480 


Leu Leu Gin 


Lys 


Glu 


Gin Asn 


490 






495 


He Ser Glu 


Pro 


Arg 


He Ser 






510 




Leu Thr Glu 


Thr 


Lys 


Thr Thr 




525 




Phe Ser Leu 


Asp 


Asp 


Leu Gin 


540 








Val Pro Ala 


Asn 


Thr 


Glu Asn 


555 






560 


Ala Ala Asp 


Arg 


Gly 


Leu Thr 


570 






575 


He Lys Thr 


Glu 


Glu 


He Ser 




590 




His Asp Ser 


Gly 


Tyr 


Glu Val 




605 






Glu Asp Val 


Gly 


Ser 


Asn Lys 


620 








Gly Val Val 


He 


Ala 


Thr Val 


635 






640 


Lys Lys Lys 


Leu 


Gly 


Thr Glu 


650 






655 


Asn Ser Ala 


Asp 


He 


Lys Leu 






670 




He Cys Arg 


Leu 


Lys 


Lys Leu 




685 






Ala Lys Cys 


Leu 


Lys 


Asn Asn 


700 








Lys Arg Ser 


Pro 


Leu 


Thr Arg 


715 






720 


Leu Glu Arg 


Leu 


Glu 


Gin Leu 


730 






735 


Leu Asp Met 


He 


Leu 


Lys Met 






750 




Leu Thr Gly 


Leu 


Phe 


Val Gin 




765 






Asp Arg Leu 


Ala 


Ser 


Val Glu 


780 








His Arg He 


Ser 


Ala 


Thr Ser 


795 






800 


Gin Arg Gin 


Leu 


Thr 


Val Ser 


810 






815 


Thr Asp Val 


Ser 


Leu 


Gly Asp 






830 




Ala Met Ala 


His 


Ala 


Asp Ala 




845 






Gly Asp Gly 


Asp 


Ser 


Pro Gly 


860 








Pro Tyr Gly 


Ala 


Leu 


Asp Met 


875 






880 


Thr Asp Ala 


Leu 


Gly 


He Asp 


890 






895 


Gly Ala Ala 


Ala 


Asp 


Ala Ala 






910 




Lys Met Gin 


Gin 


Asn 


Gly Tyr 




925 






Gin Met Gin 


Asn 







940 
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<211> 3354 
<212> DNA 
<213> hiaman 

<400> 17 

agtttcctcg gcagcggtag gcgagagcac gcggaggagc gtgcgcgggg gccccgggag 60 
acggcggcgg tggcggcgcg ggcagagcaa ggacgcggcg gatcccactc gcacagcagc 120 
gcactcggtg ccccgcgcag ggtcgcgatg ctgcccggtt tggcactgct cctgctggcc 180 
gcctggacgg ctcgggcgct ggaggtaccc actgatggta atgctggcct gctggctgeia 240 
ccccagattg ccatgttctg tggcagactg aacatgcaca tgaatgtcca gaatgggaag 300 
tgggattcag atccatcagg gaccaaaacc tgcattgata ccaaggaagg catcctgcag 360 
tattgccaag aagtctaccc tgaactgcag atcaccaatg tggtagaagc caaccaacca 420 
gtgaccatcc agaactggtg caagcggggc cgcaagcagt gcaagaccca tccccacttt 480 
srtgattccct accgctgctt agttggtgag tttgtaagtg atgcccttct cgttcctgac 540 
aagtgcaaat tcttacacca ggagaggatg gatgtttgcg aaactcatct tcactggcac 600 
accgtcgcca aagagacatg cagtgagaag agtaccaact tgcatgacta cggcatgttg 660 
ctgccctgcg gaattgacaa gttccgaggg gtagagtttg tgtgttgccc actggctgaa 720 
gaaagtgaca atgtggattc tgctgatgcg gaggaggatg actcggatgt ctggtggggc 780 
ggagcagaca cagactatgc agatgggagt gaagacaaag tagtagaagt agcagaggag 840 
gaagaagtgg ctgaggtgga agaagaagaa gccgatgatg acgaggacga tgaggatggt 900 
gatgaggtag aggaagaggc tgaggaaccc tacgaagaag ccacagagag aaccaccagc 960 
attgccacca ccaccaccac caccacagag tctgtggaag aggtggttcg agttcctaca 1020 
acagcagcca gtacccctga tgccgttgac aagtatctcg agacacctgg ggatgagaat 1080 
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagagaatg 1140 
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1200 
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 1260 
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1320 
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1380 
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1440 
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1500 
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1560 
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1620 
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1680 
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1740 
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1800 
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1860 
gaccgaggac tgaccactcg accaggttct gggttgacaa atatcaagac ggaggagatc 1920 
tctgaagtga agatggatgc agaattccga catgactcag gatatgaagt tcatcatcaa 1980 
aaattggtgt tctttgcaga agatgtgggt tcaaacaaag gtgcaatcat tggactcatg 2040 
gtgggcggtg ttgtcatagc gacagtgatc gtcatcacct tggtgatgct gaagaagaaa 2100 
cagtacacat ccattcatca tggtgtggtg gaggttgacg ccgctgtcac cccagaggag 2160 
cgccacctgt ccaagatgca gcagaacggc tacgaaaatc caacctacaa gttctttgag 2220 
cagatgcaga actagacccc cgccacagca gcctctgaag ttggacagca aaaccattgc 2280 
ttcactaccc atcggtgtcc atttatagaa taatgtggga agaaacaaac ccgttttatg 2340 
atttactcat tatcgccttt tgacagctgt gctgtaacac aagtagatgc ctgaacttga 2400 
attaatccac acatcagtaa tgtattctat ctctctttac attttggtct ctatactaca 2460 
ttattaatgg gttttgtgta ctgtaaagaa tttagctgta tcaaactagt gcatgaatag 2520 
attctctcct gattatttat cacatagccc cttagccagt tgtatattat tcttgtggtt 2580 
tgtgacccaa ttaagtccta ctttacatat gctttaagaa tcgatggggg atgcttcatg 2640 
tgaacgtggg agttcagctg cttctcttgc ctaagtattc ctttcctgat cactatgcat 2700 
tttaaagtta aacattttta agtatttcag atgctttaga gagatttttt ttccatgact 2760 
gcattttact gtacagattg ctgcttctgc tatatttgtg atataggaat taagaggata 2820 
cacacgtttg tttcttcgtg cctgttttat gtgcacacat taggcattga gacttcaagc 2880 
ttttcttttt ttgtccacgt atctttgggt ctttgataaa gaaaagaatc cctgttcatt 2940 
gtaagcactt ttacggggcg ggtggggagg ggtgctctgc tggtcttcaa ttaccaagaa 3000 
ttctccaaaa caattttctg caggatgatt gtacagaatc attgcttatg acatgatcgc 3060 
tttctacact gtattacata aataaattaa ataaaataac cccgggcaag acttttcttt 3120 
gaaggatgac tacagacatt aaataatcga agtaattttg ggtggggaga agaggcagat 3180 
tcaattttct ttaaccagtc tgaagtttca tttatgatac aaaagaagat gaaaatggaa 3240 
gtggcaatat aaggggatga ggaaggcatg cctggacaaa cccttctttt aagatgtgtc 3300 
ttcaatttgt ataaaatggt gttttcatgt aaataaatac attcttggag gage 3354 



-23- 



21040 

WO 03/072041 



p C T / y S O 3 . """" " " 

PCTAJS03/05458 



<210> 18 
<211> 695 
<212> PRT 
<213> human 



<400> 18 



Met 


Leu 


Pro 


Gly 


Leu 


Ala 


Leu Leu Leu Leu Ala Ala 


Trp 


Thr 


Ala 


Arg 


1 






5 




10 






15 




Ala 


Leu 


Glu 


Val 


Pro 


Thr 


Asp Gly Asn Ala Gly Leu 


Leu 


Ala 


Glu 


Pro 








20 






25 




30 






Gin 


He 


Ala 


Met 


Phe 


Cys 


Gly Arg Leu Asn Met His 


Met 


Asn 


Val 


Gin 






35 








40 


45 








Asn 


Gly 


Lys 


Trp 


Asp 


Ser 


Asp Pro Ser Gly Thr Lys 


Thr 


Cys 


He 


Asp 




50 










55 60 










Thr 


Lys 


Glu 


Gly 


He 


Leu 


Gin Tyr Cys Gin Glu Val 


Tyr 


Pro 


Glu 


Leu 


65 










70 


75 








80 


Gin 


He 


Thr 


Asn 


Val 


Val 


Glu Ala Asn Gin Pro Val 


Thr 


He 


Gin 


Asn 










85 




90 






95 




Trp 


Cys 


Lys 


Arg 


Gly 


Arg 


Lys Gin Cys Lys Thr His 


Pro 


His 


Phe 


val 








100 






105 




110 






He 


Pro 


Tyr 


Arg 


Cys 


Leu 


Val Gly Glu Phe Val Ser 


Asp 


Ala 


Leu 


Leu 






115 








120 


125 








Val 


Pro 


Asp 


Lys 


Cys 


Lys 


Phe Leu His Gin Glu Arg 


Met 


Asp 


Val 


Cys 




130 










135 140 










Glu 


Thr 


His 


Leu 


His 


Trp 


His Thr Val Ala Lys Glu 


Thr 


Cys 


Ser 


Glu 


145 










150 


155 








160 


Lys 


Ser 


Thr 


Asn 


Leu 


His 


Asp Tyr Gly Met Leu Leu 


Pro 


Cys 


Gly 


He 








165 




170 






175 




Asp 


Lys 


Phe 


Arg 


Gly 


Val 


Glu Phe Val Cys Cys Pro 


Leu 


Ala 


Glu 


Glu 








180 






185 




190 






Ser 


Asp 


Asn 


val 


Asp 


Ser 


Ala Asp Ala Glu Glu Asp 


Asp 


Ser 


Asp 


Val 






195 








200 


205 








Trp 


Trp 


Gly 


Gly 


Ala 


Asp 


Thr Asp Tyr Ala Asp Gly 


Ser 


Glu 


Asp 


Lys 




210 










215 220 










Val 


Val 


Glu 


Val 


Ala 


Glu 


Glu Glu Glu Val Ala Glu 


Val 


Glu 


Glu 


Glu 


225 










230 


235 








240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu 


Asp Asp Glu Asp Gly Asp 


Glu 


Val 


Glu 


Glu 










245 




250 






255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr 


Glu Glu Ala Thr Glu Arg 


Thr 


Thr 


Ser 


He 








260 




265 




270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr Thr Glu Ser Val Glu 


Glu 


Val 


Val 


Arg 






275 








280 


285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala 


Ser Thr Pro Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 




290 










295 300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu 


Asn Glu His Ala His Phe 


Gin 


Lys 


Ala 


Lys 


305 






310 


315 








320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys 


His Arg Glu Arg Met Ser 


Gin 


Val 


Met 


Arg 










325 


330 






335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu 


Arg Gin Ala Lys Asn Leu 


Pro 


Lys 


Ala 


Asp 








340 






345 




350 






Lys 


Lys 


Ala 


Val 


He 


Gin 


His Phe Gin Glu Lys Val 


Glu 


Ser 


Leu 


Glu 




355 








360 


365 








Gin 


Glu 


Ala 


Ala 


Asn 


Glu 


Arg Gin Gin Leu Val Glu 


Thr 


His 


Met 


Ala 




370 










375 380 










Arg 


Val 


Glu 


Ala 


Met 


Leu 


Asn Asp Arg Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 


395 








400 


Tyr 


He 


Thr 


Ala 


Leu 


Gin 


Ala Val Pro Pro Arg Pro 


Arg 


His 


Val 


Phe 








405 




410 






415 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr 


Val Arg Ala Glu Gin Lys 


Asp 


Arg 


Gin 


His 








420 






425 




430 






Thr 


Leu 


I'YS 


His 


Phe 


Glu 


His Val Arg Met Val Asp 


Pro 


Lys 


Lys 


Ala 






435 








440 


445 
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Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 

450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 

485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 

515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Tzp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 

565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 

580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 

610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 

645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 

675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 

<210> 19 
<211> 3972 
<212> DNA 
<213> htunan 

<400> 19 

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60 
ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120 
ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180 
ctcgtgacca ccttcaccta cggcgtgcag tgcttcgccc gctaccccga ccacatgaag 240 
cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300 
ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360 
gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420 
aagctggagt acaactacaa cagccacaag gtctatatca ccgccgacaa gcagaagaac 480 
ggcatcaagg tgaacttcaa gacccgccac aacatcgagg acggcagcgt gcagctcgcc 540 
gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600 
tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660 
ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaagtaa 720 
ctcgagtcta gagggcccgt ttaaacccgc tgatcagcct cgactgtgcc ttctagttgc 780 
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 840 
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 900 
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 960 
catgctgggg atgcggtggg ctctatggct tctgaggcgg aaagaaccag catgtgagca 1020 
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 1080 
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 1140 
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 1200 
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 1260 
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 1320 
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tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 1380 
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 1440 
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 1500 
tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 1560 
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 1620 
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 1680 
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgacatta 1740 
acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg tgatgacggt 1800 
gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc 1860 
gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt 1920 
aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg 1980 
cacagatgcg taaggagaaa ataccgcatc aggacgcgcc ctgtagcggc gcattaagcg 2040 
cggcgggtgt ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg 2100 
ctcctttcgc tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc 2160 
taaatcgggg gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa 2220 
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc 2280 
ctttgacgtt ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac 2340 
tcaaccctat ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt 2400 
ggttaaaaaa tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc 2460 
ttacaatttc cattcgccat tcaggctgaa ctagatctag agtccgttac ataacttacg 2520 
gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg 2580 
tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta 2640 
cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt 2700 
gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac 2760 
tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt 2820 
tggcagtaca tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac 2880 
cccattgacg tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt 2940 
cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat 3000 
ataagcagag ctcgtttagt gaaccgtcag atcgcctgga gacgccatcc acgctgtttt 3060 
gacctccata gaagacaccg ggaccgatcc agcctccgcg gccgggaacg gtgcattgga 3120 
acggaccgtg ttgacaatta atcatcggca tagtatatcg gcatagtata atacgacaag 3180 
gtgaggaact aaaccatggc caagttgacc agtgccgttc cggtgctcac cgcgcgcgac 3240 
gtcgccggag cggtcgagtt ctggaccgac cggctcgggt tctcccggga cttcgtggag 3300 
gacgacttcg ccggtgtggt ccgggacgac gtgaccctgt tcatcagcgc ggtccaggac 3360 
caggtggtgc cggacaacac cctggcctgg gtgtgggtgc gcggcctgga cgagctgtac 3420 
gccgagtggt cggaggtcgt gtccacgaac ttccgggacg cctccgggcc ggccatgacc 3480 
gagatcggcg agcagccgtg ggggcgggag ttcgccctgc gcgacccggc cggcaactgc 3540 
gtgcacttcg tggccgagga gcaggactga cactcgacct cgaaacttgt ttattgcagc 3600 
ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc 3660 
actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatcga 3720 
tacttcaaga actgctgaca tcgagcttgc tacaagggac tttccgctgg ggactttcca 3780 
gggaggcgtg gcctgggcgg gactggggag tggcgagccc tcagatcctg catataagca 3840 
gctgcttttt gcctgtactg ggtctctctg gttagaccag atctgagcct gggagctctc 3900 
tggctaacta gggaacccac tgcttaagcc tcaataaagc ttggtaccga gctcggatcc 3960 
gaattcgcca cc 3972 

<210> 20 
<211> 3972 
<212> DNA 
<213> hiaman 



taccactcgt tcccgctcct cgacaagtgg ccccaccacg ggtaggacca gctcgacctg 60 
ccgctgcatt tgccggtgtt caagtcgcac aggccgctcc cgctcccgct acggtggatg 120 
ccgttcgact gggacttcaa gtagacgtgg tggccgttcg acgggcacgg gaccgggtgg 180 
gagcactggt ggaagtggat gccgcacgtc acgaagcggg cgatggggct ggtgtacttc 240 
gtcgtgctga agaagttcag gcggtacggg cttccgatgc aggtcctcgc gtggtagaag 300 
aagttcctgc tgccgttgat gttctgggcg cggctccact tcaagctccc gctgtgggac 360 
cacttggcgt agctcgactt cccgtagctg aagttcctcc tgccgttgta ggaccccgtg 420 
ttcgacctca tgttgatgtt gtcggtgttc cagatatagt ggcggctgtt cgtcttcttg 480 
ccgtagttcc acttgaagtt ctgggcggtg ttgtagctcc tgccgtcgca cgtcgagcgg 540 
ctggtgatgg tcgtcttgtg ggggtagccg ctgccggggc acgacgacgg gctgttggtg 600 



<400> 20 
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atggactcgt gggtcaggcg ggactcgttt 
gacgacctca agcactggcg gcggccctag 
gagctcagat ctcccgggca aatttgggcg 
gtcggtagac aacaaacggg gagggggcac 
tgacaggaaa ggattatttt actcctttaa 
taagaccccc caccccaccc cgtcctgtcg 
gtacgacccc tacgccaccc gagataccga 
tttccggtcg ttttccggtc cttggcattt 
gaggcggggg gactgctcgt agtgttttta 
tgtcctgata tttctatggt ccgcaaaggg 
ggctgggacg gcgaatggcc tatggacagg 
agagtatcga gtgcgacatc catagagtca 
acacacgtgc ttggggggca agtcgggctg 
ctcaggttgg gccattctgt gctgaatagc 
tcgtctcgct ccatacatcc gccacgatgt 
atgtgatctt cttgtcataa accatagacg 
tctcaaccat cgagaactag gccgtttgtt 
acgttcgtcg tctaatgcgc gtcttttttt 
tgccccagac tgcgagtcac cttgcttttg 
tggatatttt tatccgcata gtgctccggg 
cttttggaga ctgtgtacgt cgagggcctc 
ccctcgtctg ttcgggcagt cccgcgcagt 
ttgatacgcc gtagtctcgt ctaacatgac 
gtgtctacgc attcctcttt tatggcgtag 
gccgcccaca ccaccaatgc gcgtcgcact 
gaggaaagcg aaagaaggga aggaaagagc 
atttagcccc cgagggaaat cccaaggcta 
ttgaactaat cccactacca agtgcatcac 
gaaactgcaa cctcaggtgc aagaaattat 
agttgggata gagccagata agaaaactaa 
ccaatttttt actcgactaa attgttttta 
aatgttaaag gtaagcggta agtccgactt 
catttaccgg gcggaccgac tggcgggttg 
atacaagggt atcattgcgg ttatccctga 
gccatttgac gggtgaaccg tcatgtagtt 
ctgcagttac tgccatttac cgggcggacc 
aaaggatgaa ccgtcatgta gatgcataat 
accgtcatgt agttacccgc acctatcgcc 
gggtaactgc agttaccctc aaacaaaacc 
gcattgttga ggcggggtaa ctgcgtttac 
tattcgtctc gagcaaatca cttggcagtc 
ctggaggtat cttctgtggc cctggctagg 
tgcctggcac aactgttaat tagtagccgt 
cactccttga tttggtaccg gttcaactgg 
cagcggcctc gccagctcaa gacctggctg 
ctgctgaagc ggccacacca ggccctgctg 
gtccaccacg gcctgttgtg ggaccggacc 
cggctcacca gcctccagca caggtgcttg 
ctctagccgc tcgtcggcac ccccgccctc 
cacgtgaagc accggctcct cgtcctgact 
aatattacca atgtttattt cgttatcgta 
tgacgtaaga tcaacaccaa acaggtttga 
atgaagttct tgacgactgt agctcgaacg 
ccctccgcac cggacccgcc ctgacccctc 
cgacgaaaaa cggacatgac ccagagagac 
accgattgat cccttgggtg acgaattcgg 
cttaagcggt gg 

<210> 21 
<211> 7153 
<212> DNA 
<213> human 



ctggggttgc tcttcgcgct agtgtaccag 660 
tgagagccgt acctgctcga catgttcatt 720 
actagtcgga gctgacacgg aagatcaacg 780 
ggaaggaact gggaccttcc acggtgaggg 840 
cgtagcgtaa cagactcatc cacagtaaga 900 
ttccccctcc taacccttct gttatcgtcc 960 
agactccgcc tttcttggtc gtacactcgt 1020 
ttccggcgca acgaccgcaa aaaggtatcc 1080 
gctgcgagtt cagtctccac cgctttgggc 1140 
ggaccttcga gggagcacgc gagaggacaa 1200 
cggaaagagg gaagcccttc gcaccgcgaa 1260 
agccacatcc agcaagcgag gttcgacccg 1320 
gcgacgcgga ataggccatt gatagcagaa 1380 
ggtgaccgtc gtcggtgacc attgtcctaa 1440 
ctcaagaact tcaccaccgg attgatgccg 1500 
cgagacgact tcggtcaatg gaagcctttt 1560 
tggtggcgac catcgccacc aaaaaaacaa 1620 
cctagagttc ttctaggaaa ctagaaaaga 1680 
agtgcaattc cctaaaacca gtactgtaat 1740 
aaagcagagc gcgcaaagcc actactgcca 1800 
tgccagtgtc gaacagacat tcgcctacgg 1860 
cgcccacaac cgcccacagc cccgaccgaa 1920 
tctcacgtgg tatacgccac actttatggc 1980 
tcctgcgcgg gacatcgccg cgtaattcgc 2040 
ggcgatgtga acggtcgcgg gatcgcgggc 2100 
ggtgcaagcg gccgaaaggg gcagttcgag 2160 
aatcacgaaa tgccgtggag ctggggtttt 2220 
ccggtagcgg gactatctgc caaaaagcgg 2280 
cacctgagaa caaggtttga ccttgttgtg 2340 
atattcccta aaacggctaa agccggataa 2400 
aattgcgctt aaaattgttt tataattgcg 2460 
gatctagatc tcaggcaatg tattgaatgc 2520 
ctgggggcgg gtaactgcag ttattactgc 2580 
aaggtaactg cagttaccca cctcataaat 2640 
cacatagtat acggttcatg cgggggataa 2700 
gtaatacggg tcatgtactg gaataccctg 2760 
cagtagcgat aatggtacca ctacgccaaa 2820 
aaactgagtg cccctaaagg ttcagaggtg 2880 
gtggttttag ttgccctgaa aggttttaca 2940 
ccgccatccg cacatgccac cctccagata 3000 
tagcggacct ctgcggtagg tgcgacaaaa 3060 
tcggaggcgc cggcccttgc cacgtaacct 3120 
atcatatagc cgtatcatat tatgctgttc 3180 
tcacggcaag gccacgagtg gcgcgcgctg 3240 
gccgagccca agagggccct gaagcacctc 3300 
cactgggaca agtagtcgcg ccaggtcctg 3360 
cacacccacg cgccggacct gctcgacatg 3420 
aaggccctgc ggaggcccgg ccggtactgg 3480 
aagcgggacg cgctgggccg gccgttgacg 3540 
gtgagctgga gctttgaaca aataacgtcg 3600 
gtgtttaaag tgtttatttc gtaaaaaaag 3660 
gtagttacat agaatagtac agacctagct 3720 
atgttccctg aaaggcgacc cctgaaaggt 3780 
accgctcggg agtctaggac gtatattcgt 3840 
caatctggtc tagactcgga ccctcgagag 3900 
agttatttcg aaccatggct cgagcctagg 3960 

3972 
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<400> 21 

gacggatcgg gagatctccc gatcccctat 
ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 
ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gtttaaactt aagcttcccc gcgcagggtc 
ctggccgcct ggacggctcg ggcgctggag 
gctgaacccc agattgccat gttctgtggc 
gggaagtggg attcagatcc atcagggacc 
ctgcagtatt gccaagaagt ctaccctgaa 
caaccagtga ccatccagaa ctggtgcaag 
cactttgtga ttccctaccg ctgcttagtt 
cctgacaagt gcaaattctt acaccaggag 
tggcacaccg tcgccaaaga gacatgcagt 
atgttgctgc cctgcggaat tgacaagttc 
gctgaagaaa gtgacaatgt ggattctgct 
tggggcggag cagacacaga ctatgcagat 
gaggaggaag aagtggctga ggtggaagaa 
gatggtgatg aggtagagga agaggctgag 
accagcattg ccaccaccac caccaccacc 
cctacaacag cagccagtac ccctgatgcc 
gagaatgaac atgcccattt ccagaaagcc 
agaatgtccc aggtcatgag agaatgggaa 
aaagctgata agaaggcagt tatccagcat 
gaagcagcca acgagagaca gcagctggtg 
ctcaatgacc gccgccgcct ggccctggag 
cctcggcctc gtcacgtgtt caatatgcta 
agacagcaca ccctaaagca tttcgagcat 
cagatccggt cccaggttat gacacacctc 
ctctccctgc tctacaacgt gcctgcagtg 
ctgcttcaga aagagcaaaa ctattcagat 
aggatcagtt acggaaacga tgctctcatg 
gagctccttc ccgtgaatgg agagttcagc 
ggggctgact ctgtgccagc caacacagaa 
gctgccgacc gaggactgac cactcgacca 
gagatctctg aagtgaatct agatgcagaa 
catcaaaaat tggtgttctt tgcagaagat 
ctcatggtgg gcggtgttgt catagcgaca 
aagaaagata tcatggagcc agtagatcct 
cagcctaaaa ctgcttgtac caattgctat 
tgtttcatga caaaagcctt aggcatctcc 
agagctcatc agaacagtca gactcatcaa 
cgctcgagtc tagagggccc gtttaaaccc 
gccagccatc tgttgtttgc ccctcccccg 
ccactgtcct ttcctaataa aatgaggaaa 
ctattctggg gggtggggtg gggcaggaca 
ggcatgctgg ggatgcggtg ggctctatgg 
ctagggggta tccccacgcg ccctgtagcg 
cgcgcagcgt gaccgctaca cttgccagcg 
cttcctttct cgccacgttc gccggctttc 



ggtcgactct cagtacaatc tgctctgatg 60 
cttgtgtgtt ggaggtcgct gagtagtgcg 120 
gcttgaccga caattgcatg aagaatctgc 180 
atgtacgggc cagatatacg cgttgacatt 240 
ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 420 
aaactgccca cttggcagta catcaagtgt 480 
tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caeiatgggcg 780 
gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gctggctagc 900 
gcgatgctgc ccggtttggc actgctcctg 960 
gtacccactg atggtaatgc tggcctgctg 1020 
agactgaaca tgcacatgaa tgtccagaat 1080 
aaaacctgca ttgataccaa ggaaggcatc 1140 
ctgcagatca ccaatgtggt agaagccaac 1200 
cggggccgca agcagtgcaa gacccatccc 1260 
ggtgagttta taagtgatgc ccttctcgtt 1320 
aggatggatg tttgcgaaac tcatcttcac 1380 
gagaagagta ccaacttgca tgactacggc 1440 
cgaggggtag agtttgtgtg ttgcccactg 1500 
gatgcggagg aggatgactc ggatgtctgg 1560 
gggagtgaag acaaagtagt agaagtagca 1620 
gaagaagccg atgatgacga ggacgatgag 1680 
gaaccctacg aagaagccac agagagaacc 1740 
acagagtctg tggaagaggt ggttcgagtt 1800 
gttgacaagt atctcgagac acctggggat 1860 
aaagagaggc ttgaggccaa gcaccgagag 1920 
gaggcagaac gtcaagcaaa gaacttgcct 1980 
ttccaggaga aagtggaatc tttggaacag 2040 
gagacacaca tggccagagt ggaagccatg 2100 
aactacatca ccgctctgca ggctgttcct 2160 
aagaagtatg tccgcgcaga acagaaggac 2220 
gtgcgcatgg tggatcccaa gaaagccgct 2280 
cgtgtgattt atgagcgcat gaatcagtct 2340 
gccgaggaga ttcaggatga agttgatgag 2400 
gacgtcttgg ccaacatgat tagtgaacca 2460 
ccatctttga ccgaaacgaa aaccaccgtg 2520 
ctggacgatc tccagccgtg gcattctttt 2580 
aacgaagttg agcctgttga tgcccgccct 2640 
ggttctgggt tgacaaatat caagacggag 2700 
ttccgacatg actcaggata tgaagttcat 2760 
gtgggttcaa acaaaggtgc aatcattgga 2820 
gtgatcgtca tcaccttggt gatgctgaag 2880 
agactagagc cctggaagca tccaggaagt 2940 
tgtaaaaagt gttgctttca ttgccaagtt 3000 
tatggcagga agaagcggag acagcgacga 3060 
gcttctctat caaagcagta agtaggcggc 3120 
gctgatcagc ctcgactgtg ccttctagtt 3180 
tgccttcctt gaccctggaa ggtgccactc 3240 
ttgcatcgca ttgtctgagt aggtgtcatt 3300 
gcaaggggga ggattgggaa gacaatagca 3360 
cttctgaggc ggaaagaacc agctggggct 3420 
gcgcattaag cgcggcgggt gtggtggtta 3480 
ccctagcgcc cgctcctttc gctttcttcc 3540 
cccgtcaagc tctaaatcgg ggcatccctt 3600 



-28- 



21040 



WO 03/072041 



'>CTAJS03/05458 



tagggttccg 
gttcacgtag 
cgttctttaa 
attcttttga 
tttaacaaaa 
gtccccaggc 
ccaggtgtgg 
attagtcagc 
gttccgccca 
ccgcctctgc 
tttgcaaaaa 
caattaatca 
catggccaag 
cgagttctgg 
tgtggtccgg 
caacaccctg 
ggtcgtgtcc 
gccgtggggg 
cgaggagcag 
gttgggcttc 
catgctggag 
aagcaatagc 
tttgtccaaa 
cttggcgtaa 
acacaacata 
actcacatta 
gctgcattaa 
cgcttcctcg 
tcactcaaag 
gtgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
tcgtcttgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
gagattatca 
aatctaaagt 
acctatctca 
gataactacg 
cccacgctca 
cagaagtggt 
tagagtaagt 
cgtggtgtca 
gcgagttaca 
cgttgtcaga 
ttctcttact 
gtcattctga 
taataccgcg 
gcgaaaactc 
acccaactga 
aaggcaaaat 
cttccttttt 
atttgaatgt 
gccacctgac 



atttagtgct 
tgggccatcg 
tagtggactc 
tttataaggg 
atttaacgcg 
tccccaggca 
aaagtcccca 
aaccatagtc 
ttctccgccc 
ctctgagcta 
gctcccggga 
tcggcatagt 
ttgaccagtg 
accgaccggc 
gacgacgtga 
gcctgggtgt 
acgaacttcc 
cgggagttcg 
gactgacacg 
ggaatcgttt 
ttcttcgccc 
atcacaaatt 
ctcatcaatg 
tcatggtcat 
cgagccggaa 
attgcgttgc 
tgaatcggcc 
ctcactgact 
gcggtaatac 
ggccagcaaa 
cgcccccctg 
ggactataaa 
accctgccgc 
caatgctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
gggtctgacg 
aaaaggatct 
atatatgagt 
gcgatctgtc 
atacgggagg 
ccggctccag 
cctgcaactt 
agttcgccag 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
gaatagtgta 
ccacatagca 
tcaaggatct 
tcttcagcat 
gccgcaaaaa 
caatattatt 
atttagaaaa 
gtc 



ttacggcacc 
ccctgataga 
ttgttccaaa 
attttgggga 
aattaattct 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 
gcttgtatat 
atatcggcat 
ccgttccggt 
tcgggttctc 
ccctgttcat 
gggtgcgcgg 
gggacgcctc 
ccctgcgcga 
tgctacgaga 
tccgggacgc 
accccaactt 
tcacaaataa 
tatcttatca 
agctgtttcc 
gcataaagtg 
gctcactgcc 
aacgcgcggg 
cgctgcgctc 
ggttatccac 
aggccaggaa 
acgagcatca 
gataccaggc 
ttaccggata 
gctgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
ttacgcgcag 
ctcagtggaa 
tcacctagat 
aaacttggtc 
tatttcgttc 
gcttaccatc 
atttatcagc 
tatccgcctc 
ttaatagttt 
ttggtatggc 
tgttgtgcaa 
ccgcagtgtt 
ccgtaagatg 
tgcggcgacc 
gaactttaaa 
taccgctgtt 
cttttacttt 
agggaataag 
gaagcattta 
ataaacaaat 



tcgaccccaa 
cggtttttcg 
ctggaacaac 
tttcggccta 
gtggaatgtg 
tgcaaagcat 
caggcagaag 
ctccgcccat 
taattttttt 
agtgaggagg 
ccattttcgg 
agtataatac 
gctcaccgcg 
ccgggacttc 
cagcgcggtc 
cctggacgag 
cgggccggcc 
cccggccggc 
tttcgattcc 
cggctggatg 
gtttattgca 
agcatttttt 
tgtctgtata 
tgtgtgaaat 
taaagcctgg 
cgctttccag 
gagaggcggt 
ggtcgttcgg 
agaatcaggg 
ccgtaaaaag 
caaaaatcga 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
ccttttaaat 
tgacagttac 
atccatagtt 
tggccccagt 
aataaaccag 
catccagtct 
gcgcaacgtt 
ttcattcagc 
aaaagcggtt 
atcactcatg 
cttttctgtg 
gagttgctct 
agtgctcatc 
gagatccagt 
caccagcgtt 
ggcgacacgg 
tcagggttat 
aggggttccg 



aaaacttgat 
ccctttgacg 
actcaaccct 
ttggttaaaa 
tgtcagttag 
gcatctcaat 
tatgcaaagc 
cccgccccta 
tatttatgca 
cttttttgga 
atctgatcag 
gacaaggtga 
cgcgacgtcg 
gtggaggacg 
caggaccagg 
ctgtacgccg 
atgaccgaga 
aactgcgtgc 
accgccgcct 
atcctccagc 
gcttataatg 
tcactgcatt 
ccgtcgacct 
tgttatccgc 
ggtgcctaat 
tcgggaaacc 
ttgcgtattg 
ctgcggcgag 
gataacgcag 
gccgcgttgc 
cgctcaagtc 
ggaagctccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 
taaaaatgaa 
caatgcttaa 
gcctgactcc 
gctgcaatga 
ccagccggaa 
attaattgtt 
gttgccattg 
tccggttccc 
agctccttcg 
gttatggcag 
actggtgagt 
tgcccggcgt 
attggaaaac 
tcgatgtaac 
tctgggtgag 
aaatgttgaa 
tgtctcatga 
cgcacatttc 



tagggtgatg 
ttggagtcca 
atctcggtct 
aatgagctga 
ggtgtggaaa 
tagtcagcaa 
atgcatctca 
actccgccca 
gaggccgagg 
ggcctaggct 
cacgtgttga 
ggaactaaac 
ccggagcggt 
acttcgccgg 
tggtgccgga 
agtggtcgga 
tcggcgagca 
acttcgtggc 
tctatgaaag 
gcggggatct 
gttacaaata 
ctagttgtgg 
ctagctagag 
tcacaattcc 
gagtgagcta 
tgtcgtgcca 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 
gttttaaatc 
tcagtgaggc 
ccgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
actcaaccaa 
caatacggga 
gttcttcggg 
ccactcgtgc 
caaaaacagg 
tactcatact 
gcggatacat 
cccgaaaagt 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7153 
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<210> 22 
<211> 7153 
<212> DNA 
<213> human 

<400> 22 

ctgcctagcc ctctagaggg ctaggggata ccagctgaga gtcatgttag acgagactac 60 
ggcgtatcaa ttcggtcata gacgagggac gaacacacaa cctccagcga ctcatcacgc 120 
gctcgtttta aattcgatgt tgttccgttc cgaactggct gttaacgtac ttcttagacg 180 
aatcccaatc cgcaaaacgc gacgaagcgc tacatgcccg gtctatatgc gcaactgtaa 240 
ctaataactg atcaataatt atcattagtt aatgccccag taatcaagta tcgggtatat 300 
acctcaaggc gcaatgtatt gaatgccatt taccgggcgg accgactggc gggttgctgg 360 
gggcgggtaa ctgcagttat tactgcatac aagggtatca ttgcggttat ccctgaaagg 420 
taactgcagt tacccacctg ataaatgcca tttgacgggt gaaccgtcat gtagttcaca 480 
tagtatacgg ttcatgcggg ggataactgc agttactgcc atttaccggg cggaccgtaa 540 
tacgggtcat gtactggaat accctgaaag gatgaaccgt catgtagatg cataatcagt 600 
agcgataatg gtaccactac gccaaaaccg tcatgtagtt acccgcacct atcgccaaac 660 
tgagtgcccc taaaggttca gaggtggggt aactgcagtt accctcaaac aaaaccgtgg 720 
ttttagttgc cctgaaaggt tttacagcat tgttgaggcg gggtaactgc gtttacccgc 780 
catccgcaca tgccaccctc cagatatatt cgtctcgaga gaccgattga tctcttgggt 840 
gacgaatgac cgaatagctt taattatgct gagtgatatc cctctgggtt cgaccgatcg 900 
caaatttgaa ttcgaagggg cgcgtcccag cgctacgacg ggccaaaccg tgacgaggac 960 
gaccggcgga cctgccgagc ccgcgacctc catgggtgac taccattacg accggacgac 1020 
cgacttgggg tctaacggta caagacaccg tctgacttgt acgtgtactt acaggtctta 1080 
cccttcaccc taagtctagg tagtccctgg ttttggacgt aactatggtt ccttccgtag 1140 
gacgtcataa cggttcttca gatgggactt gacgtctagt ggttacacca tcttcggttg 1200 
gttggtcact ggtaggtctt gaccacgttc gccccggcgt tcgtcacgtt ctgggtaggg 1260 
gtgaaacact aagggatggc gacgaatcaa ccactcaaat attcactacg ggaagagcaa 1320 
ggactgttca cgtttaagaa tgtggtcctc tcctacctac aaacgctttg agtagaagtg 1380 
accgtgtggc agcggtttct ctgtacgtca ctcttctcat ggttgaacgt actgatgccg 1440 
tacaacgacg ggacgcctta actgttcaag gctccccatc tcaaacacac aacgggtgac 1500 
cgacttcttt cactgttaca cctaagacga ctacgcctcc tcctactgag cctacagacc 1560 
accccgcctc gtctgtgtct gatacgtcta ccctcacttc tgtttcatca tcttcatcgt 1620 
ctcctccttc ttcaccgact ccaccttctt cttcttcggc tactactgct cctgctactc 1680 
ctaccactac tccatctcct tctccgactc cttgggatgc ttcttcggtg tctctcttgg 1740 
tggtcgtaac ggtggtggtg gtggtggtgg tgtctcagac accttctcca ccaagctcaa 1800 
ggatgttgtc gtcggtcatg gggactacgg caactgttca tagagctctg tggaccccta 1860 
ctcttacttg tacgggtaaa ggtctttcgg tttctctccg aactccggtt cgtggctctc 1920 
tcttacaggg tccagtactc tcttaccctt ctccgtcttg cagttcgttt cttgaacgga 1980 
tttcgactat tcttccgtca ataggtcgta aaggtcctct ttcaccttag aaaccttgtc 2040 
cttcgtcggt tgctctctgt cgtcgaccac ctctgtgtgt accggtctca ccttcggtac 2100 
gagttactgg cggcggcgga ccgggacctc ttgatgtagt ggcgagacgt ccgacaagga 2160 
ggagccggag cagtgcacaa gttatacgat ttcttcatac aggcgcgtct tgtcttcctg 2220 
tctgtcgtgt gggatttcgt aaagctcgta cacgcgtacc acctagggtt ctttcggcga 2280 
gtctaggcca gggtccaata ctgtgtggag gcacactaaa tactcgcgta cttagtcaga 2340 
gagagggacg agatgttgca cggacgtcac cggctcctct aagtcctact tcaactactc 2400 
gacgaagtct ttctcgtttt gataagtcta ctgcagaacc ggttgtacta atcacttggt 2460 
tcctagtcaa tgcctttgct acgagagtac ggtagaaact ggctttgctt ttggtggcac 2520 
ctcgaggaag ggcacttacc tctcaagtcg gacctgctag aggtcggcac cgtaagaaaa 2580 
ccccgactga gacacggtcg gttgtgtctt ttgcttcaac tcggacaact acgggcggga 2640 
cgacggctgg ctcctgactg gtgagctggt ccaagaccca actgtttata gttctgcctc 2700 
ctctagagac ttcacttaga tctacgtctt aaggctgtac tgagtcctat acttcaagta 2760 
gtagttttta accacaagaa acgtcttcta cacccaagtt tgtttccacg ttagtaacct 2820 
gagtaccacc cgccacaaca gtatcgctgt cactagcagt agtggaacca ctacgacttc 2880 
ttctttctat agtacctcgg tcatctagga tctgatctcg ggaccttcgt aggtccttca 2940 
gtcggatttt gacgaacatg gttaacgata acatttttca caacgaaagt aacggttcaa 3000 
acaaagtact gttttcggaa tccgtagagg ataccgtcct tcttcgcctc tgtcgctgct 3060 
tctcgagtag tcttgtcagt ctgagtagtt cgaagagata gtttcgtcat tcatccgccg 3120 
gcgagctcag atctcccggg caaatttggg cgactagtcg gagctgacac ggaagatcaa 3180 
cggtcggtag acaacaaacg gggagggggc acggaaggaa ctgggacctt ccacggtgag 3240 
ggtgacagga aaggattatt ttactccttt aacgtagcgt aacagactca tccacagtaa 3300 
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gataagaccc cccaccccac cccgtcctgt 
ccgtacgacc cctacgccac ccgagatacc 
gatcccccat aggggtgcgc gggacatcgc 
gcgcgtcgca ctggcgatgt gaacggtcgc 
gaaggaaaga gcggtgcaag cggccgaaag 
atcccaaggc taaatcacga aatgccgtgg 
caagtgcatc acccggtagc gggactatct 
gcaagaaatt atcacctgag aacaaggttt 
taagaaaact aaatattccc taaaacccct 
aaattgtttt taaattgcgc ttaattaaga 
caggggtccg aggggtccgt ccgtcttcat 
ggtccacacc tttcaggggt ccgaggggtc 
taatcagtcg ttggtatcag ggcggggatt 
caaggcgggt aagaggcggg gtaccgactg 
ggcggagacg gagactcgat eiaggtcttca 
aaacgttttt cgagggccct cgaacatata 
gttaattagt agccgtatca tatagccgta 
gtaccggttc aactggtcac ggcaaggcca 
gctcaagacc tggctggccg agcccaagag 
acaccaggcc ctgctgcact gggacaagta 
gttgtgggac cggacccaca cccacgcgcc 
ccagcacagg tgcttgaagg ccctgcggag 
cggcaccccc gccctcaagc gggacgcgct 
gctcctcgtc ctgactgtgc acgatgctct 
caacccgaag ccttagcaaa aggccctgcg 
gtacgacctc aagaagcggg tggggttgaa 
ttcgttatcg tagtgtttaa agtgtttatt 
aaacaggttt gagtagttac atagaatagt 
gaaccgcatt agtaccagta tcgacaaagg 
tgtgttgtat gctcggcctt cgtatttcac 
tgagtgtaat taacgcaacg cgagtgacgg 
cgacgtaatt acttagccgg ttgcgcgccc 
gcgaaggagc gagtgactga gcgacgcgag 
agtgagtttc cgccattatg ccaataggtg 
cactcgtttt ccggtcgttt tccggtcctt 
ggtatccgag gcggggggac tgctcgtagt 
tttgggctgt cctgatattt ctatggtccg 
aggacaaggc tgggacggcg aatggcctat 
ccgcgaaaga gttacgagtg cgacatccat 
cgacccgaca cacgtgcttg gggggcaagt 
agcagaactc aggttgggcc attctgtgct 
gtcctaatcg tctcgctcca tacatccgcc 
gatgccgatg tgatcttcct gtcataaacc 
gcctttttct caaccatcga gaactaggcc 
aaaacaaacg ttcgtcgtct aatgcgcgtc 
gaaaagatgc cccagactgc gagtcacctt 
ctctaatagt ttttcctaga agtggatcta 
ttagatttca tatatactca tttgaaccag 
tggatagagt cgctagacag ataaagcaag 
ctattgatgc tatgccctcc cgaatggtag 
gggtgcgagt ggccgaggtc taaatagtcg 
gtcttcacca ggacgttgaa ataggcggag 
atctcattca tcaagcggtc aattatcaaa 
gcaccacagt gcgagcagca aaccataccg 
cgctcaatgt actagggggt acaacacgtt 
gcaacagtct tcattcaacc ggcgtcacaa 
aagagaatga cagtacggta ggcattctac 
cagtaagact cttatcacat acgccgctgg 
attatggcgc ggtgtatcgt cttgaaattt 
cgcttttgag agttcctaga atggcgacaa 
tgggttgact agaagtcgta gaaaatgaaa 
ttccgtttta cggcgttttt tcccttattc 



cgttccccct cctaaccctt ctgttatcgt 3360 
gaagactccg cctttcttgg tcgaccccga 3420 
cgcgtaattc gcgccgccca caccaccaat 3480 
gggatcgcgg gcgaggaaag cgaaagaagg 3540 
gggcagttcg agatttagcc ccgtagggaa 3600 
agctggggtt ttttgaacta atcccactac 3660 
gccaaaaagc gggaaactgc aacctcaggt 3720 
gaccttgttg tgagttggga tagagccaga 3780 
aaagccggat aaccaatttt ttactcgact 3840 
caccttacac acagtcaatc ccacaccttt 3900 
acgtttcgta cgtagagtta atcagtcgtt 3960 
gtccgtcttc atacgtttcg tacgtagagt 4020 
gaggcgggta gggcggggat tgaggcgggt 4080 
attaaaaaaa ataaatacgt ctccggctcc 4140 
tcactcctcc gaaaaaacct ccggatccga 4200 
ggtaaaagcc tagactagtc gtgcacaact 4260 
tcatattatg ctgttccact ccttgatttg 4320 
cgagtggcgc gcgctgcagc ggcctcgcca 4380 
ggccctgaag cacctcctgc tgaagcggcc 4440 
gtcgcgccag gtcctggtcc accacggcct 4500 
ggacctgctc gacatgcggc tcaccagcct 4560 
gcccggccgg tactggctct agccgctcgt 4620 
gggccggccg ttgacgcacg tgaagcaccg 4680 
aaagctaagg tggcggcgga agatactttc 4740 
gccgacctac taggaggtcg cgcccctaga 4800 
caaataacgt cgaatattac caatgtttat 4860 
tcgtaaaaaa agtgacgtaa gatcaacacc 4920 
acagacatat ggcagctgga gatcgatctc 4980 
acacacttta acaataggcg agtgttaagg 5040 
atttcggacc ccacggatta ctcactcgat 5100 
gcgaaaggtc agccctttgg acagcacggt 5160 
ctctccgcca aacgcataac ccgcgagaag 5220 
ccagcaagcc gacgccgctc gccatagtcg 5280 
tcttagtccc ctattgcgtc ctttcttgta 5340 
ggcatttttc cggcgcaacg accgcaaaaa 5400 
gtttttagct gcgagttcag tctccaccgc 5460 
caaaggggga ccttcgaggg agcacgcgag 5520 
ggacaggcgg aaagagggaa gcccttcgca 5580 
agagtcaagc cacatccagc aagcgaggtt 5640 
cgggctggcg acgcggaata ggccattgat 5700 
gaatagcggt gaccgtcgtc ggtgaccatt 5760 
acgatgtctc aagaacttca ccaccggatt 5820 
atagacgcga gacgacttcg gtcaatggaa 5880 
gtttgtttgg tggcgaccat cgccaccaaa 5940 
tttttttcct agagttcttc taggaaacta 6000 
gcttttgagt gcaattccct aaaaccagta 6060 
ggaaaattta atttttactt caaaatttag 6120 
actgtcaatg gttacgaatt agtcactccg 6180 
taggtatcaa cggactgagg ggcagcacat 6240 
accggggtca cgacgttact atggcgctct 6300 
ttatttggtc ggtcggcctt cccggctcgc 6360 
gtaggtcaga taattaacaa cggcccttcg 6420 
cgcgttgcaa caacggtaac gatgtccgta 6480 
aagtaagtcg aggccaaggg ttgctagttc 6540 
ttttcgccaa tcgaggaagc caggaggcta 6600 
tagtgagtac caataccgtc gtgacgtatt 6660 
gaaaagacac tgaccactca tgagttggtt 6720 
ctcaacgaga acgggccgca gttatgccct 6780 
tcacgagtag taaccttttg caagaagccc 6840 
ctctaggtca agctacattg ggtgagcacg 6900 
gtggtcgcaa agacccactc gtttttgtcc 6960 
ccgctgtgcc tttacaactt atgagtatga 7020 
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gaaggaaaaa gttataataa cttcgtaaat agtcccaata acagagtact cgcctatgta 7080 
taaacttaca taaatctttt tatttgttta tccccaaggc gcgtgtaaag gggcttttca 7140 
cggtggactg cag 7153 



<210> 23 
<211> 2352 
<212> DNA 

<213> fusion protein - human 



<400> 23 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

tggaagcatc 

tgctttcatt 

aagcggagac 

aagcagagga 

cacctgtcca 

atgcagaact 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
caggaagtca 
gccaagtttg 
agcgacgaag 
tatccagcac 
agatgcagca 
ag 



<210> 24 
<211> 783 
<212> PRT 

<213> fusion protein 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gcctaaaact 
tttcatgaca 
agctcatcag 
agtggcggcc 
gaacggctac 



human 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 
ctggagccag 
gcttgtacca 
aaagccttag 
aacagtcaga 
gcagacgccg 
gaaaatccaa 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagcceta 
gactgaccac 
tgaactttga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
tagatcctag 
attgctattg 
gcatctccta 
ctcatcaagc 
ctgtcacccc 
cctacaagtt 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
agtggaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
actagagccc 
taaaaagtgt 
tggcaggaag 
ttctctatca 
agaggagcgc 
ctttgagcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2352 



<400> 24 

Met Leu Pro 6ly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 



-32- 



21040 

WO 03/072041 



PCT/U&03 "-a^'^^-D^f 

PCT/US03/05458 



20 



Gin 


lie 


Ala 


Met Phe Cys Gly 


Arg 






35 




40 


Asn 


Gly 


Lys 


Trp Asp Ser Asp 


Pro 




50 




55 




Thr 


Lys 


Glu 


Gly He Leu Gin 


Tyr 


65 






70 




Gin 


lie 


Thr 


Asn Val Val Glu 


Ala 








85 




Trp 


Cys 


Lys 


Arg Gly Arg Lys 


Gin 








100 




lie 


Pro 


Tyr 


Arg Cys Leu Val 


Gly 






115 




120 


Val 


Pro 


Asp 


Lys Cys Lys Phe 


Leu 




130 




135 




Glu 


Thr 


His 


Leu His Trp His 


Thr 


145 






150 




Lys 


Ser 


Thr 


Asn Leu His Asp 


Tyr 






165 




Asp 


Lys 


Phe 


Arg Gly Val Glu 


Phe 








180 




Ser 


Asp 


Asn 


Val Asp Ser Ala 


Asp 






195 




200 


Trp 


Trp 


Gly 


Gly Ala Asp Thr 


Asp 




210 




215 




Val 


Val 


Glu 


Val Ala Glu Glu 


Glu 


225 






230 




Glu 


Ala 


Asp 


Asp Asp Glu Asp 


Asp 








245 




Glu 


Ala 


Glu 


Glu Pro Tyr Glu 


Glu 








260 




Ala 


Thr 


Thr 


Thr Thr Thr Thr 


Thr 






275 




280 


Val 


Pro 


Thr 


Thr Ala Ala Ser 


Thr 




290 




295 




Glu 


Thr 


Pro 


Gly Asp Glu Asn 


Glu 


305 






310 




Glu 


Arg 


Leu 


Glu Ala Lys His 


Arg 








325 




Glu 


Trp 


Glu 


Glu Ala Glu Arg 


Gin 








340 




Lys 


Lys 


Ala 


Val He Gin His 


Phe 




355 




360 


Gin 


Glu 


Ala 


Ala Asn Glu Arg 


Gin 




370 




375 




Arg 


Val 


Glu 


Ala Met Leu Asn 


Asp 


385 






390 




Tyr 


lie 


Thr 


Ala Leu Gin Ala 


Val 






405 




Asn 


Met 


Leu 


Lys Lys Tyr Val 


Arg 








420 




Thr 


Leu 


Lys 


His Phe Glu His 


Val 






435 




440 


Ala 


Gin 


He 


Arg Ser Gin Val 


Met 




450 




455 




Arg 


Met 


Asn 


Gin Ser Leu Ser 


Leu 


465 






470 




Glu 


Glu 


lie 


Gin Asp Glu Val 


Asp 








485 




Tyr 


Ser 


Asp 


Asp Val Leu Ala 


Asn 




500 




Tyr 


Gly 


Asn 


Asp Ala Leu Met 


Pro 



25 






30 




Leu 


Asn Met His 


Met 


Asn 


Val Gin 






45 






Ser 


Gly Thr Lys 


Thr 


Cys 


He Asp 




60 








Cys 


Gin Glu Val 


Tyr 


Pro 


Glu Leu 


75 




80 


Asn 


Gin Pro Val 


Thr 


He 


Gin Asn 




90 






95 


Cys 


Lys Thr His 


Pro 


His 


Phe Val 


105 






110 




Glu 


Phe He Ser 


Asp 


Ala 


Leu Leu 






125 






His 


Gin Glu Arg 


Met 


Asp 


Val Cys 




140 








Val 


Ala Lys Glu 


Thr 


Cys 


Ser Glu 




155 






160 


Gly 


Met Leu Leu 


Pro 


Cys 


Gly He 




170 






175 


Val 


Cys Cys Pro 


Leu 


Ala 


Glu Glu 


185 




190 




Ala 


Glu Glu Asp 


Asp 


Ser 


Asp Val 






205 






Tyr 


Ala Asp Gly 


Ser 


Glu 


Asp Lys 




220 








Glu 


Val Ala Glu 


Val 


Glu 


Glu Glu 




235 






240 


Glu 


Asp Gly Asp 


Glu 


Val 


Glu Glu 




250 






255 


Ala 


Thr Glu Arg 


Thr 


Thr 


Ser He 


265 






270 




Glu 


Ser Val Glu 


Glu 


Val 


Val Arg 






285 






Pro 


Asp Ala Val 


Asp 


Lys 


Tyr Leu 




300 








His 


Ala His Phe 


Gin 


Lys 


Ala Lys 




315 






320 


Glu 


Arg Met Ser 


Gin 


Val 


Met Arg 




330 






335 


Ala 


Lys Asn Leu 


Pro 


Lys 


Ala Asp 


345 






350 




Gin 


Glu Lys Val 


Glu 


Ser 


Leu Glu 






365 






Gin 


Leu Val Glu 


Thr 


His 


Met Ala 




380 








Arg 


Arg Arg Leu 


Ala 


Leu 


Glu Asn 




395 






400 


Pro 


Pro Arg Pro 


Arg 


His 


Val Phe 




410 




415 


Ala 


Glu Gin Lys 


Asp 


Arg 


Gin His 


425 




430 




Arg 


Met Val Asp 


Pro 


Lys 


Lys Ala 






445 






Thr 


His Leu Arg 


Val 


He 


Tyr Glu 




460 








Leu 


Tyr Asn Val 


Pro 


Ala 


Val Ala 




475 






480 


Glu 


Leu Leu Gin 


Lys 


Glu 


Gin Asn 




490 




495 


Met 


He Ser Glu 


Pro 


Arg 


He Ser 


505 






510 




Ser 


Leu Thr Glu 


Thr 


Lys 


Thr Thr 
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515 


520 


Val 


Glu 


Leu Leu Pro Val Asn 


Gly 




530 


535 




Pro 


Trp 


His Ser Phe Gly Ala 


Asp 


545 




550 




61u 


Val 


Glu Pro Val Asp Ala 


Arg 






565 




Thr 


Arg 


Pro Gly Ser Gly Leu 


Thr 






580 




Glu 


Val 


Asn Phe Glu Val Glu 


Phe 






595 


600 


His 


His 


Gin Val Leu Val Phe 


Phe 




610 


615 




Gly 


Ala 


lie lie Gly Leu Met 


Val 


625 




630 




lie 


Val 


lie Thr Leu Val Met 


Leu 






645 




Leu 


Gly 


Ser Thr Ser Pro Val 


Trp 




660 




Pro 


Val 


Asp Pro Arg Leu Glu 


Pro 






675 


680 


Lys 


Thr 


Ala Cys Thr Asn Cys 


Tyr 




690 


695 




Gin 


Val 


Cys Phe Met Thr Lys 


Ala 


705 




710 




Lys 


Arg 


Arg Gin Arg Arg Arg 


Ala 






725 




Ala 


Ser 


Leu Ser Lys Gin Arg 


He 






740 




Ala 


Ala 


Val Thr Pro Glu Glu 


Arg 






755 


760 


Gly 


Tyr 


Glu Asn Pro Thr Tyr 


Lys 


770 


775 







525 








Glu Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 


540 










Ser Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


555 








560 


Pro Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 


570 






575 




Asn He Lys Thr 


Glu 


Glu 


He 


Ser 


585 




590 






Arg His Asp Ser 


Gly 
605 


Tyr 


Glu 


val 


Ala Glu Asp Val 


Gly 


Ser 


Asn 


Lys 


620 










Gly Gly Val Val 


He 


Ala 


Thr 


Val 


635 








640 


Lys Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 


650 






655 




Trp Asn Ser Ala 


Asp 


He 


Leu 


Glu 


665 




670 






Trp Lys His Pro 


Gly 


Ser 


Gin 


Pro 


685 








Cys Lys Lys Cys 


Cys 


Phe 


His 


Cys 


700 










Leu Gly He Ser 


Tyr 


Gly 


Arg 


Lys 


715 








720 


His Gin Asn Ser 


Gin 


Thr 


His 


Gin 


730 






735 




Ser Ser Thr Val 


Ala 


Ala 


Ala 


Asp 


745 




750 






His Leu Ser Lys 


Met 
765 


Gin 


Gin 


Asn 


Phe Phe Glu Gin 


Met 


Gin 


Asn 





780 



<210> 25 
<211> 2823 
<212> DNA 

<213> fusion protein - human 
<400> 25 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttataa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggct 
gcggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 



gccgcctgga cggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct gcggaattga caagttccga 540 
gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgatgagg tagaggaaga ggctgaggaa 780 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccgtt 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcatgagaga atgggaagag 1020 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 1140 
aatgaccgcc gccgcctggc cctggagaac 1200 
cggcctcgtc acgtgttcaa tatgctaaag 1260 
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aagtatgtcc 
cgcatggtgg 
gtgatttatg 
gaggagattc 
gtcttggcca 
.tctttgaccg 
gacgatctcc 
gaagttgagc 
tctgggttga 
cgacatgact 
ggttcaaaca 
atcgtcatca 
agtccagtgt 
gatatttgcc 
ctgaagaaca 
gcacatctga 
tttcctcgag 
ttgttaacag 
gcttcagtgg 
tcatcggaag 
gatctggccc 
gtggcgatgg 
gattccccgg 
gccgacttcg 
gatatccagc 
aagatgcagc 
tag 



gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 
gattatttgt 
agactgatat 
agagtagtaa 
ccccgaccga 
cgcatgccga 
gtccgggatt 
agtttgagca 
acagtggcgg 
agaacggcta 



gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 
acaagataat 
gcctctaaca 
caaaggtcaa 
tgtcagcctg 
cgcgctagac 
taccccccac 
gatgtttacc 
ccgcgacgcc 
cgaaaatcca 



cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caagtattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 
gtgaataaag 
ttgagacagc 
agacagttga 
ggggacgagc 
gatttcgatc 
gactccgccc 
gatgcccttg 
gctgtcaccc 
acctacaagt 



taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaactttga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
agtggaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgcgacatca 
aattcccggg 
cggcgaggac 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgcagaac 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2823 



<210> 26 
<211> 941 
<212> PRT 

<213> fusion protein - human 
<400> 26 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe He Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val 


Val 


Glu 


Val 


Ala 


Glu Glu Glu 


Glu Val Ala Glu 


Val 


Glu 


Glu 


Glu 


225 










230 


235 








240 


Glu 


Ala 


Asp 


Asp 


Asp 


Glu Asp Asp 


Glu Asp Gly Asp 


Glu 


Val 


Glu 


Glu 










245 




250 






255 




Glu 


Ala 


Glu 


Glu 


Pro 


Tyr Glu Glu 


Ala Thr Glu Arg 


Thr 


Thr 


Ser 


He 








260 




265 




270 






Ala 


Thr 


Thr 


Thr 


Thr 


Thr Thr Thr 


Glu Ser Val Glu 


Glu 


Val 


Val 


Arg 






275 






280 




285 








Val 


Pro 


Thr 


Thr 


Ala 


Ala Ser Thr 


Pro Asp Ala Val 


Asp 


Lys 


Tyr 


Leu 




290 








295 


300 










Glu 


Thr 


Pro 


Gly 


Asp 


Glu Asn Glu 


His Ala His Phe 


Gin 


Lys 


Ala 


Lys 


305 








310 


315 








320 


Glu 


Arg 


Leu 


Glu 


Ala 


Lys His Arg 


Glu Arg Met Ser 


Gin 


Val 


Met 


Arg 








325 




330 






335 




Glu 


Trp 


Glu 


Glu 


Ala 


Glu Arg Gin 


Ala Lys Asn Leu 


Pro 


Lys 


Ala 


Asp 






340 






345 




350 






Lys 


Lys 


Ala 


Val 


He 


Gin His Phe 


Gin Glu Lys Val 


Glu 


Ser 


Leu 


Glu 


355 






360 


365 








Gin 


Glu 


Ala 


Ala 


Asn 


Glu Arg Gin 


Gin Leu Val Glu 


Thr 


His 


Met 


Ala 




370 








375 


380 










Arg 


Val 


Glu 


Ala 


Met 


Leu Asn Asp 


Arg Arg Arg Leu 


Ala 


Leu 


Glu 


Asn 


385 










390 


395 








400 


Tyr 


He 


Thr 


Ala 


Leu 


Gin Ala Val 


Pro Pro Arg Pro 


Arg 


His 


Val 


Phe 








405 




410 






415 




Asn 


Met 


Leu 


Lys 


Lys 


Tyr Val Arg 


Ala Glu Gin Lys 


Asp 


Arg 


Gin 


His 








420 






425 




430 






Thr 


Leu 


Lys 


His 


Phe 


Glu His Val 


Arg Met Val Asp 


Pro 


Lys 


Lys 


Ala 






435 






440 




445 








Ala 


Gin 


He 


Arg 


Ser 


Gin Val Met 


Thr His Leu Arg 


Val 


He 


Tyr 


Glu 




450 






455 


460 










Arg 


Met 


Asn 


Gin 


Ser 


Leu Ser Leu 


Leu Tyr Asn Val 


Pro 


Ala 


Val 


Ala 


465 










470 


475 








480 


Glu 


Glu 


He 


Gin 


Asp 


Glu Val Asp 


Glu Leu Leu Gin 


Lys 


Glu 


Gin 


Asn 










485 




490 






495 




Tyr 


Ser 


Asp 


Asp 


Val 


Leu Ala Asn 


Met He Ser Glu 


Pro 


Arg 


He 


Ser 




500 






505 




510 






Tyr 


Gly 


Asn 


Asp 


Ala 


Leu Met Pro 


Ser Leu Thr Glu 


Thr 


Lys 


Thr 


Thr 


515 




520 




525 








Val 


Glu 


Leu 


Leu 


Pro 


Val Asn Gly 


Glu Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 




530 








535 


540 










Pro 


Trp 


His 


Ser 


Phe 


Gly Ala Asp 


Ser Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


545 








550 


555 








560 


Glu 


Val 


Glu 


Pro 


Val 


Asp Ala Arg 


Pro Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 










565 




570 






575 




Thr 


Arg 


Pro 


Gly 


Ser 


Gly Leu Thr 


Asn He Lys Thr 


Glu 


Glu 


He 


Ser 






580 






585 




590 






Glu 


Val 


Asn 


Phe 


Glu 


Val Glu Phe 


Arg His Asp Ser 


Gly 


Tyr 


Glu 


Val 






595 






600 


605 








His 


His 


Gin 


Val 


Leu 


Val Phe Phe 


Ala Glu Asp Val 


Gly 


Ser 


Asn 


Lys 




610 








615 


620 










Gly 


Ala 


He 


He 


Gly 


Leu Met Val 


Gly Gly Val Val 


He 


Ala 


Thr 


Val 


625 










630 


635 








640 


He 


Val 


He 


Thr 


Leu 


Val Met Leu 


Lys Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 










645 




650 






655 




Leu 


Gly 


Ser 


Thr 


Ser 


Pro Val Trp 


Trp Asn Ser Ala 


Asp 


He 


Lys 


Leu 






660 






665 




670 






Leu 


Ser 


Ser 


He 


Glu 


Gin Ala Cys 


Asp He Cys Arg 


Leu 


Lys 


Lys 


Leu 






675 






680 




685 








Lys 


Cys 


Ser 


Lys 


Glu 


Lys Pro Lys 


Cys Ala Lys Cys 


Leu 


Lys 


Asn 


Asn 


690 






695 


700 










Trp 


Glu 


Cys 


Arg 


Tyr 


Ser Pro Lys 


Thr Lys Arg Ser 


Pro 


Leu 


Thr 


Arg 


705 






710 


715 








720 
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Ala 


His 


Leu Thr Glu Val Glu Ser Arg Leu Glu 


Arg 


Leu 


Glu 


Gin 


Leu 






725 730 








735 




Phe 


Leu 


Leu lie Phe Pro Arg Glu Asp Leu Asp 


Met 


He 


Leu 


Lys 


Met 






740 745 






750 






Asp 


Ser 


Leu Gin Asp lie Lys Ala Leu Leu Thr 


Gly 


Leu 


Phe 


Val 


Gin 




755 760 




765 








Asp 


Asn 


Val Asn Lys Asp Ala Val Thr Asp Arg 


Leu 


Ala 


Ser 


Val 


Glu 


770 


775 


780 










Thr 


Asp 


Met Pro Leu Thr Leu Arg Gin His Arg 


He 


Ser 


Ala 


Thr 


Ser 


785 


790 795 










800 


Ser 


Ser 


Glu Glu Ser Ser Asn Lys Gly Gin Arg 


Gin 


Leu 


Thr 


Val 


Ser 






805 810 








815 




Gly 


lie 


Pro Gly Asp Leu Ala Pro Pro Thr Asp 


Val 


Ser 


Leu 


Gly 


Asp 




820 825 






830 






61u 


Leu 


His Leu Asp Gly Glu Asp Val Ala Met 


Ala 


His 


Ala 


Asp 


Ala 






835 840 




845 








Leu 


Asp 


Asp Phe Asp Leu Asp Met Leu Gly Asp 


Gly 


Asp 


Ser 


Pro 


Gly 




850 


855 


860 










Pro 


Gly 


Phe Thr Pro His Asp Ser Ala Pro Tyr 


Gly 


Ala 


Leu 


Asp 


Met 


865 


870 875 










880 


Ala 


Asp 


Phe Glu Phe Glu Gin Met Phe Thr Asp 


Ala 


Leu 


Gly 


He 


Asp 




885 890 








895 




Glu 


Tyr 


Gly Gly Asp He Gin His Ser Gly Ala 


Ala 


Ala 


Asp 


Ala 


Ala 




900 905 






910 






Val 


Thr 


Pro Glu Glu Arg His Leu Ser Lys Met 


Gin 


Gin 


Asn 


Gly 


Tyr 






915 920 




925 








Glu 


Asn 


Pro Thr Tyr Lys Phe Phe Glu Gin Met 


Gin 


Asn 










930 


935 


940 











<210> 27 
<211> 5015 
<212> DNA 
<213> human 

<400> 27 
gacggatcgg 
ccgcatagtt 
cgagcaaaat 
ttagggttag 
gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gtttaaactt 
agatatccag 
gcctcgactg 
ttgaccctgg 
cattgtctga 
gaggattggg 
gcggaaagaa 
agcgcggcgg 
cccgctcctt 
gctctaaatc 
aaaaaacttg 



gagatctccc 
aagccagtat 
ttaagctaca 
gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
aagcttggta 
cacagtggcg 
tgccttctag 
aaggtgccac 
gtaggtgtca 
aagacaatag 
ccagctgggg 
gtgtggtggt 
tcgctttctt 
ggggcatccc 
attagggtga 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
ccgagctcgg 
gccgctcgag 
ttgccagcca 
tcccactgtc 
ttctattctg 
caggcatgct 
ctctaggggg 
tacgcgcagc 
cccttccttt 
tttagggttc 
tggttcacgt 



ggtcgactct 
cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
atccactagt 
tctagagggc 
tctgttgttt 
ctttcctaat 
gggggtgggg 

ggggatgcgg 
tatccccacg 
gtgaccgcta 
ctcgccacgt 
cgatttagtg 
agtgggccat 



cagtacaatc 
ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ccagtgtggt 
ccgtttaaac 
gcccctcccc 
aaaatgagga 
tggggcagga 
tgggctctat 
cgccctgtag 
cacttgccag 
tcgccggctt 
ctttacggca 
cgccctgata 



tgctctgatg 
gagtagtgcg 
aagaatctgc 
cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctggctagc 
ggaattctgc 
ccgctgatca 
cgtgccttcc 
aattgcatcg 
cagcaagggg 
ggcttctgag 
cggcgcatta 
cgccctagcg 
tccccgtcaa 
cctcgacccc 
gacggttttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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cgccctttga 
acactcaacc 
tattggttaa 
tgtgtcagtt 
atgcatctca 
agtatgcaaa 
atCGcgcccc 
tttatttatg 
ggcttttttg 
ggatctgatc 
acgacaaggt 
cgcgcgacgt 
tcgtggagga 
tccaggacca 
agctgtacgc 
ccatgaccga 
gcaactgcgt 
ccaccgccgc 
tgatcctcca 
cagcttataa 
tttcactgca 
taccgtcgac 
attgttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 
cgcgcacatt 



cgttggagtc 
ctatctcggt 
aaaatgagct 
agggtgtgga 
attagtcagc 
gcatgcatct 
taactccgcc 
cagaggccga 
gaggcctagg 
agcacgtgtt 
gaggaactaa 
cgccggagcg 
cgacttcgcc 
ggtggtgccg 
cgagtggtcg 
gatcggcgag 
gcacttcgtg 
cttctatgaa 
gcgcggggat 
tggttacaaa 
ttctagttgt 
ctctagctag 
gctcacaatt 
atgagtgagc 
cctgtcgtgc 
tgggcgctct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaatacgg 
acgttcttcg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 
tccccgaaaa 



cacgttcttt 
ctattctttt 
gatttaacaa 
aagtccccag 
aaccaggtgt 
caattagtca 
cagttccgcG 
ggccgcctct 
cttttgcaaa 
gacaattaat 
accatggcca 
gtcgagttct 
ggtgtggtcc 
gacaacaccc 
gaggtcgtgt 
cagccgtggg 
gccgaggagc 
aggttgggct 
ctcatgctgg 
taaagcaata 
ggtttgtcca 
agcttggcgt 
ccacacaaca 
taactcacat 
cagctgcatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 
gggcgaaaac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 
gtgccacctg 



aatagtggac 
gatttataag 
aaatttaacg 
gctccccagg 
ggaaagtccc 
gcaaccatag 
cattctccgc 
gcctctgagc 
aagctcccgg 
catcggcata 
agttgaccag 
ggaccgaccg 
gggacgacgt 
tggcctgggt 
ccacgaactt 
ggcgggagtt 
aggactgaca 
tcggaatcgt 
agttcttcgc 
gcatcacaaa 
aactcatcaa 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcaatgctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 
gtatttagaa 
acgtc 



tcttgttcca 
ggattttggg 
cgaattaatt 
caggcagaag 
caggctcccc 
tcccgcccct 
cccatggctg 
tattccagaa 
gagcttgtat 
gtatatcggc 
tgccgttccg 
gctcgggttc 
gaccctgttc 
gtgggtgcgc 
ccgggacgcc 
cgccctgcgc 
cgtgctacga 
tttccgggac 
ccaccccaac 
tttcacaaat 
tgtatcttat 
atagctgttt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 
aaataaacaa 



aactggaaca 
gatttcggcc 
ctgtggaatg 
tatgcaaagc 
agcaggcaga 
aactccgccc 
actaattttt 
gtagtgagga 
atccattttc 
atagtataat 
gtgctcaccg 
tcccgggact 
atcagcgcgg 
ggcctggacg 
tccgggccgg 
gacccggccg 
gatttcgatt 
gccggctgga 
ttgtttattg 
aaagcatttt 
catgtctgta 
cctgtgtgaa 
tgtaaagcct 
cccgctttcc 
gggagaggcg 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 
cagcccgacc 
gacttatcgc 
ggtgctacag 
ggtatctgcg 
ggcaaacaaa 
agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 
ataggggttc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5015 
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<211> 5015 
<212> DKIA 
<213> human 

<400> 28 
ctgcctagcc 
ggcgtatcaa 
gctcgtttta 
aatcccaatc 
ctaataactg 
acctcaaggc 
gggcgggtaa 
taactgcagt 
tagtatacgg 
tacgggtcat 
agcgataatg 
tgagtgcccc 
ttttagttgc 
catccgcaca 
gacgaatgac 
caaatttgaa 
tctataggtc 
cggagctgac 
aactgggacc 
gtaacagact 
ctcctaaccc 
cgcctttctt 
tcgcgccgcc 
gggcgaggaa 
cgagatttag 
ttttttgaac 
gcgggaaact 
tgtgagttgg 
ataaccaatt 
acacagtcaa 
tacgtagagt 
tcatacgttt 
tagggcgggg 
aaataaatac 
ccgaaaaaac 
cctagactag 
tgctgttcca 
gcgcgctgca 
agcacctcct 
aggtcctggt 
tcgacatgcg 
ggtactggct 
cgttgacgca 
ggtggcggcg 
actaggaggt 
gtcgaatatt 
aaagtgacgt 
atggcagctg 
taacaatagg 
ccccacggat 
tcagcccttt 
caaacgcata 
ccgacgccgc 
ccctattgcg 
tccggcgcaa 
ctgcgagttc 
gaccttcgag 



ctctagaggg 
ttcggtcata 
aattcgatgt 
cgcaaaacgc 
atcaataatt 
gcaatgtatt 
ctgcagttat 
tacccacctg 
ttcatgcggg 
gtactggaat 
gtaccactac 
taaaggttca 
cctgaaaggt 
tgccaccctc 
cgaatagctt 
ttcgaaccat 
gtgtcaccgc 
acggaagatc 
ttccacggtg 
catccacagt 
ttctgttatc 
ggtcgacccc 
cacaccacca 
agcgaaagaa 
ccccgtaggg 
taatcccact 
gcaacctcag 
gatagagcca 
ttttactcga 
tcccacacct 
taatcagtcg 
cgtacgtaga 
attgaggcgg 
gtctccggct 
ctccggatcc 
tcgtgcacaa 
ctccttgatt 
gcggcctcgc 
gctgaagcgg 
ccaccacggc 
gctcaccagc 
ctagccgctc 
cgtgaagcac 
gaagatactt 
cgcgccccta 
accaatgttt 
aagatcaaca 
gagatcgatc 
cgagtgttaa 
tactcactcg 
ggacagcacg 
acccgcgaga 
tcgccatagt 
tcctttcttg 
cgaccgcaaa 
agtctccacc 
ggagcacgcg 



ctaggggata 
gacgagggac 
tgttccgttc 
gacgaagcgc 
atcattagtt 
gaatgccatt 
tactgcatac 
ataaatgcca 
ggataactgc 
accctgaaag 
gccaaaaccg 
gaggtggggt 
tttacagcat 
cagatatatt 
taattatgct 
ggctcgagcc 
cggcgagctc 
aacggtcggt 
agggtgacag 
aagataagac 
gtccgtacga 
gagatccccc 
atgcgcgtcg 
gggaaggaaa 
aaatcccaag 
accaagtgca 
gtgcaagaaa 
gataagaaaa 
ctaaattgtt 
ttcaggggtc 
ttggtccaca 
gttaatcagt 
gtceiaggcgg 
ccggcggaga 
gaaaacgttt 
ctgttaatta 
tggtaccggt 
cagctcaaga 
ccacaccagg 
ctgttgtggg 
ctccagcaca 
gtcggcaccc 
cggctcctcg 
tccaacccga 
gagtacgacc 
atttcgttat 
ccaaacaggt 
tcgaaccgca 
ggtgtgttgt 
attgagtgta 
gtcgacgtaa 
aggcgaagga 
cgagtgagtt 
tacactcgtt 
aaggtatccg 
gctttgggct 
agaggacaag 



ccagctgaga 
gaacacacaa 
cgaactggct 
tacatgcccg 
aatgccccag 
taccgggcgg 
aagggtatca 
tttgacgggt 
agttactgcc 
gatgaaccgt 
tcatgtagtt 
aactgcagtt 
tgttgaggcg 
cgtctcgaga 
gagtgatatc 
taggtgatca 
agatctcccg 
agacaacaaa 
gaaaggatta 
cccccacccc 
cccctacgcc 
ataggggtgc 
cactggcgat 
gagcggtgca 
gctaaatcac 
tcacccggta 
ttatcacctg 
ctaaatattc 
tttaaattgc 
cgaggggtcc 
cctttcaggg 
cgttggtatc 
gtaagaggcg 
cggagactcg 
ttcgagggcc 
gtagccgtat 
tcaactggtc 
cctggctggc 
ccctgctgca 
accggaccca 
ggtgcttgaa 
ccgccctcaa 
tcctgactgt 
agccttagca 
tcaagaagcg 
cgtagtgttt 
ttgagtagtt 
ttagtaccag 
atgctcggcc 
attaacgcaa 
ttacttagcc 
gcgagtgact 
tccgccatta 
ttccggtcgt 
aggcgggggg 
gtcctgatat 
gctgggacgg 



gtcatgttag 
cctccagcga 
gttaacgtac 
gtctatatgc 
taatcaagta 
accgactggc 
ttgcggttat 
gaaccgtcat 
atttaccggg 
catgtagatg 
acccgcacct 
accctcaaac 
gggtaactgc 
gaccgattga 
cctctgggtt 
ggtcacacca 
ggcaaatttg 
cggggagggg 
ttttactcct 
accccgtcct 
acccgagata 
gcgggacatc 
gtgaacggtc 
agcggccgaa 
gaaatgccgt 
gcgggactat 
agaacaaggt 
cctaaaaccc 
gcttaattaa 
gtccgtcttc 
gtccgagggg 
agggcgggga 
gggtaccgac 
ataaggtctt 
ctcgaacata 
catatagccg 
acggcaaggc 
cgagcccaag 
ctgggacaag 
cacccacgcg 
ggccctgcgg 
gcgggacgcg 
gcacgatgct 
aaaggccctg 
ggtggggttg 
aaagtgttta 
acatagaata 
tatcgacaaa 
ttcgtatttc 
cgcgagtgac 
ggttgcgcgc 
gagcgacgcg 
tgccaatagg 
tttccggtcc 
actgctcgta 
ttctatggtc 
cgaatggcct 



acgagactac 
ctcatcacgc 
ttcttagacg 
gcaactgtaa 
tcgggtatat 
gggttgctgg 
ccctgaaagg 
gtagttcaca 
cggaccgtaa 
cataatcagt 
atcgccaaac 
aaaaccgtgg 
gtttacccgc 
tctcttgggt 
cgaccgatcg 
ccttaagacg 
ggcgactagt 
gcacggaagg 
ttaacgtagc 
gtcgttcccc 
ccgaagactc 
gccgcgtaat 
gcgggatcgc 
aggggcagtt 
ggagctgggg 
ctgccaaaaa 
ttgaccttgt 
ctaaagccgg 
gacaccttac 
atacgtttcg 
tcgtccgtct 
ttgaggcggg 
tgattaaaaa 
catcactcct 
taggtaaaag 
tatcatatta 
cacgagtggc 
agggccctga 
tagtcgcgcc 
ccggacctgc 
aggcccggcc 
ctgggccggc 
ctaaagctaa 
cggccgacct 
aacaaataac 
tttcgtaaaa 
gtacagacat 
ggacacactt 
acatttcgga 
gggcgaaagg 
ccctctccgc 
agccagcaag 
tgtcttagtc 
ttggcatttt 
gtgtttttag 
cgcaaagggg 
atggacaggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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ggaaagaggg 
gccacatcca 
cgacgcggaa 
gtgaccgtcg 
tcaagaactt 
gagacgactt 
ggtggcgacc 
ctagagttct 
gtgcaattcc 
taatttttac 
tggttacgaa 
aacggactga 
cacgacgtta 
tcggtcggcc 
gataattaac 
aacaacggta 
cgaggccaag 
aatcgaggaa 
accaataccg 
actgaccact 
gaacgggccg 
agtaaccttt 
caagctacat 
aaagacccac 
cctttacaac 
taacagagta 
gcgcgtgtaa 



aagcccttcg 
gcaagcgagg 
taggccattg 
tcggtgacca 
caccaccgga 
cggtcaatgg 
atcgccacca 
tctaggaaac 
ctaaaaccag 
ttcaaaattt 
ttagtcactc 
ggggcagcac 
ctatggcgct 
ttcccggctc 
aacggccctt 
acgatgtccg 
ggttgctagt 
gccaggaggc 
tcgtgacgta 
catgagttgg 
cagttatgcc 
tgcaagaagc 
tgggtgagca 
tcgtttttgt 
ttatgagtat 
ctcgcctatg 
agggerctttt 



caccgcgaaa 
ttcgacccga 
atagcagaac 
ttgtcctaat 
ttgatgccga 
aagccttttt 
aaaaaacaaa 
tagaaaagat 
tactctaata 
agttagattt 
cgtggataga 
atctattgat 
ctgggtgcga 
gcgtcttcac 
cgatctcatt 
tagcaccaca 
tccgctcaat 
tagcaacagt 
ttaagagaat 
ttcagtaaga 
ctattatggc 
cccgcttttg 
cgtgggttga 
ccttccgttt 
gagaaggaaa 
tataaactta 
cacggtggac 



gagttacgag 
cacacgtgct 
tcaggttggg 
cgtctcgctc 
tgtgatcttc 
ctcaaccatc 
cgttcgtcgt 
gccccagact 
gtttttccta 
catatatact 
gtcgctagac 
gctatgccct 
gtggccgagg 
caggacgttg 
catcaagcgg 
gtgcgagcag 
gtactagggg 
cttcattcaa 
gacagtacgg 
ctcttatcac 
gcggtgtatc 
agagttccta 
ctagaagtcg 
tacggcgttt 
aagttataat 
cataaatctt 
tgcag 



tgcgacatcc 
tggggggcaa 
ccattctgtg 
catacatccg 
ctgtcataaa 
gagaactagg 
ctaatgcgcg 
gcgagtcacc 
gaagtggatc 
catttgaacc 
agataaagca 
cccgaatggt 
tctaaatagt 
aaataggcgg 
tcaattatca 
caaaccatac 
gtacaacacg 
ccggcgtcac 
taggcattct 
atacgccgct 
gtcttgaaat 
gaatggcgac 
tagaaaatga 
tttcccttat 
aacttcgtaa 
tttatttgtt 



atagagtcaa 
gtcgggctgg 
ctgaatagcg 
ccacgatgtc 
ccatagacgc 
ccgtttgttt 
tctttttttc 
ttgcttttga 
taggaaaatt 
agactgtcaa 
agtaggtatc 
agaccggggt 
cgttatttgg 
aggtaggtca 
aacgcgttgc 
cgaagtaagt 
ttttttcgcc 
aatagtgagt 
acgaaaagac 
ggctcaacga 
tttcacgagt 
aactctaggt 
aagtggtcgc 
tcccgctgtg 
atagtcccaa 
tatccccaag 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5015 



<210> 29 
<211> 2352 
<212> DNA 

<213> fusion protein human 



<400> 29 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 



gctggaggta 
ctgtggcaga 
a0?9accaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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tctttgaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagcctg 1620 

gacgatctcc agccgtggca ttcttttggg gctgactctg tgccagccaa cacagaaaac 1680 

gaagttgagc ctgttgatgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 

tctgggttga caaatatcaa gacggaggag atctctgaag tgaactttga agtggaattc 1800 

cgacatgact caggatatga agttcatcat caaaaattgg tgttctttgc agaagatgtg 1860 

ggttcaaaca aaggtgcaat cattggactc atggtgggcg gtgttgtcat agcgacagtg 1920 

atcgtcatca ccttggtgat gctgaagaag aaaaagcttg gtaccgagct cggatccact 1980 

agtccagtgt ggtggaattc tgcagatatc ctggagccag tagatcctag actagagccc 2040 

tggaagcatc caggaagtca gcctaaaact gcttgtacca attgctattg taaaaagtgt 2100 

tgctttcatt gccaagtttg tttcatgaca aaagccttag gcatctccta tggcaggaag 2160 

aagcggagac agcgacgaag agctcatcag aacagtcaga ctcatcaagc ttctctatca 2220 

aagcagagga tatccagcac agtggcggcc gcagacgccg ctgtcacccc agaggagcgc 2280 

cacctgtcca agatgcagca gaacggctac gaaaatccaa cctacaagtt ctttgagcag 2340 

atgcagaact ag 2352 

<210> 30 
<211> 783 
<212> PRT 

<213> fusion protein - hxunan 

<400> 30 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 

50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe He Ser Asp Ala Leu Leu 

115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 

130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 

180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 

195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 

210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 

260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 

290 295 300 

Glu Thr Pro Gly T^P Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
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325 



330 



335 



Glu 


Trp 


Glu 


Glu Ala Glu Arg Gin 


Ala Lys 


Asn 


Leu 


Pro 


Lys 


Ala 


Asp 








340 


345 








350 






Lys 


Lys 


Ala 


Val He Gin His Phe 


Gin Glu 


Lys 


Val 


Glu 


Ser 


Leu 


Glu 






355 


360 








365 








Gin 


Glu 


Ala 


Ala Asn Glu Arg Gin 


Gin Leu 


Val 


Glu 


Thr 


His 


Met 


Ala 




370 




375 






380 










Arg 


Val 


Glu 


Ala Met Leu Asn Asp 


Arg Arg 


Arg 


Leu 


Ala 


Leu 


Glu 


Asn 


385 






390 




395 










400 


Tyr 


He 


Thr 


Ala Leu Gin Ala Val 


Pro Pro 


Arg 


Pro 


Arg 


His 


Val 


Phe 








405 


410 










415 




Asn 


Met 


Leu 


Lys Lys Tyr Val Arg 


Ala Glu 


Gin 


Lys 


Asp 


Arg 


Gin 


His 








420 


425 








430 






Ihr 


Leu 


Lys 


His Phe Glu His Val 


Arg Met 


Val 


Asp 


Pro 


Lys 


Lys 


Ala 






435 


440 








445 








Ala 


Gin 


He 


Arg Ser Gin Val Met 


Thr His 


Leu 


Arg 


Val 


He 


Tyr 


Glu 




450 




455 






460 










Arg 


Met 


Asn 


Gin Ser Leu Ser Leu 


Leu Tyr 


Asn 


Val 


Pro 


Ala 


Val 


Ala 


465 






470 


475 










480 


Glu 


Glu 


He 


Gin Asp Glu Val Asp 


Glu Leu 


Leu 


Gin 


Lys 


Glu 


Gin 


Asn 








485 


490 










495 




Tyr 


Ser 


Asp 


Asp Val Leu Ala Asn 


Met He 


Ser 


Glu 


Pro 


Arg 


He 


Ser 








500 


505 








510 






Tyr 


Gly 


Asn 


Asp Ala Leu Met Pro 


Ser Leu 


Thr 


Glu 


Thr 


Lys 


Thr 


Thr 






515 


520 








525 








Val 


Glu 


Leu 


Leu Pro Val Asn Gly 


Glu Phe 


Ser 


Leu 


Asp 


Asp 


Leu 


Gin 




530 




535 






540 










Pro 


Trp 


His 


Ser Phe Gly Ala Asp 


Ser Val 


Pro 


Ala 


Asn 


Thr 


Glu 


Asn 


545 






550 




555 










560 


Glu 


Val 


Glu 


Pro Val Asp Ala Arg 


Pro Ala 


Ala 


Asp 


Arg 


Gly 


Leu 


Thr 








565 


570 










575 




Thr 


Arg 


Pro 


Gly Ser Gly Leu Thr 


Asn He 


Lys 


Thr 


Glu 


Glu 


He 


Ser 








580 


585 








590 






Glu 


Val 


Asn 


Phe Glu Val Glu Phe 


Arg His 


Asp 


Ser 


Gly 


Tyr 


Glu 


Val 






595 


600 








605 








His 


His 


Gin 


Lys Leu Val Phe Phe 


Ala Glu 


Asp 


Val 


Gly 


Ser 


Asn 


Lys 




610 




615 






620 










Gly 


Ala 


He 


He Gly Leu Met Val 


Gly Gly 


Val 


Val 


He 


Ala 


Thr 


Val 


625 






630 


635 










640 


He 


Val 


He 


Thr Leu Val Met Leu 


Lys Lys 


Lys 


Lys 


Leu 


Gly 


Thr 


Glu 








645 


650 








655 




Leu 


Gly 


Ser 


Thr Ser Pro Val Trp 


Trp Asn 


Ser 


Ala 


Asp 


He 


Leu 


Glu 






660 


665 








670 






Pro 


Val 


Asp 


Pro Arg Leu Glu Pro 


Trp Lys 


His 


Pro 


Gly 


Ser 


Gin 


Pro 






675 


680 








685 








Lys 


Thr 


Ala 


Cys Thr Asn Cys Tyr 


Cys Lys 


Lys 


Cys 


Cys 


Phe 


His 


Cys 




690 




695 






700 










Gin 


Val 


Cys 


Phe Met Thr Lys Ala 


Leu Gly 


He 


Ser 


Tyr 


Gly 


Arg 


Lys 


705 






710 




715 










720 


Lys 


Arg 


Arg 


Gin Arg Arg Arg Ala 


His Gin 


Asn 


Ser 


Gin 


Thr 


His 


Gin 








725 


730 










735 




Ala 


Ser 


Leu 


Ser Lys Gin Arg He 


Ser Ser 


Thr 


Val 


Ala 


Ala 


Ala 


Asp 








740 


745 








750 






Ala 


Ala 


Val 


Thr Pro Glu Glu Arg 


His Leu 


Ser 


Lys 


Met 


Gin 


Gin 


Asn 






755 


760 






765 








Gly 


Tyr 


Glu 


Asn Pro Thr Tyr Lys 


Phe Phe 


Glu 


Gin 


Met 


Gin 


Asn 






770 




775 






780 











<210> 31 
<211> 2823 
<212> DNA 
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<213> fusion protein - human 



<400> 31 

atgctgcccg 

cccactgatg 

ctgaacatgc 

acctgcattg 

cagatcacca 

ggccgcaagc 

gagtttataa 

atggatgttt 

aagagtacca 

ggggtagagt 

gcggaggagg 

agtgaagaca 

gaagccgatg 

ccctacgaag 

gagtctgtgg 

gacaagtatc 

gagaggcttg 

gcagaacgtc 

caggagaaag 

acacacatgg 

tacatcaccg 

aagtatgtcc 

cgcatggtgg 

gtgatttatg 

gaggagattc 

gtcttggcca 

tctttgaccg 

gacgatctcc 

gaagttgagc 

tctgggttga 

cgacatgact 

ggttcaaaca 

atcgtcatca 

agtccagtgt 

gatatttgcc 

ctgaagaaca 

gcacatctga 

tttcctcgag 

ttgttaacag 

gcttcagtgg 

tcatcggaag 

gatctggccc 

gtggcgatgg 

gattccccgg 

gccgacttcg 

gatatccagc 

aagatgcagc 

tag 



gtttggcact 
gtaatgctgg 
acatgaatgt 
ataccaagga 
atgtggtaga 
agtgcaagac 
gtgatgccct 
gcgaaactca 
acttgcatga 
ttgtgtgttg 
atgactcgga 
aagtagtaga 
atgacgagga 
aagccacaga 
aagaggtggt 
tcgagacacc 
aggccaagca 
aagcaaagaa 
tggaatcttt 
ccagagtgga 
ctctgcaggc 
gcgcagaaca 
atcccaagaa 
agcgcatgaa 
aggatgaagt 
acatgattag 
aaacgaaaac 
agccgtggca 
ctgttgatgc 
caaatatcaa 
caggatatga 
aaggtgcaat 
ccttggtgat 
ggtggaattc 
gacttaaaaa 
actgggagtg 
cagaagtgga 
aagaccttga 
gattatttgt 
agactgatat 
agagtagtaa 
ccccgaccga 
cgcatgccga 
gtccgggatt 
agtttgagca 
acagtggcgg 
agaacggcta 



gctcctgctg 
cctgctggct 
ccagaatggg 
aggcatcctg 
agccaaccaa 
ccatccccac 
tctcgttcct 
tcttcactgg 
ctacggcatg 
cccactggct 
tgtctggtgg 
agtagcagag 
cgatgaggat 
gagaaccacc 
tcgagttcct 
tggggatgag 
ccgagagaga 
cttgcctaaa 
ggaacaggaa 
agccatgctc 
tgttcctcct 
gaaggacaga 
agccgctcag 
tcagtctctc 
tgatgagctg 
tgaaccaagg 
caccgtggag 
ttcttttggg 
ccgccctgct 
gacggaggag 
agttcatcat 
cattggactc 
gctgaagaag 
tgcagatatc 
gctcaagtgc 
tcgctactct 
atcaaggcta 
catgattttg 
acaagataat 
gcctctaaca 
caaaggtcaa 
tgtcagcctg 
cgcgctagac 
taccccccac 
gatgtttacc 
ccgcgacgcc 
cgaaaatcca 



gccgcctgga 
gaaccccaga 
aagtgggatt 
cagtattgcc 
ccagtgacca 
tttgtgattc 
gacaagtgca 
cacaccgtcg 
ttgctgccct 
gaagaaagtg 
ggcggagcag 
gaggaagaag 
ggtgatgagg 
agcattgcca 
acaacagcag 
aatgaacatg 
atgtcccagg 
gctgataaga 
gcagccaacg 
aatgaccgcc 
cggcctcgtc 
cagcacaccc 
atccggtccc 
tccctgctct 
cttcagaaag 
atcagttacg 
ctccttcccg 
gctgactctg 
gccgaccgag 
atctctgaag 
caaaaattgg 
atggtgggcg 
aaaaagcttg 
aagctactgt 
tccaaagaaa 
cccaaaacca 
gaaagactgg 
aaaatggatt 
gtgaataaag 
ttgagacagc 
agacagttga 
ggggacgagc 
gatttcgatc 
gactccgccc 
gatgcccttg 
gctgtcaccc 
acctacaagt 



cggctcgggc 
ttgccatgtt 
cagatccatc 
aagaagtcta 
tccagaactg 
cctaccgctg 
aattcttaca 
ccaaagagac 
gcggaattga 
acaatgtgga 
acacagacta 
tggctgaggt 
tagaggaaga 
ccaccaccac 
ccagtacccc 
cccatttcca 
tcatgagaga 
aggcagttat 
agagacagca 
gccgcctggc 
acgtgttcaa 
taaagcattt 
aggttatgac 
acaacgtgcc 
agcaaaacta 
gaaacgatgc 
tgaatggaga 
tgccagccaa 
gactgaccac 
tgaactttga 
tgttctttgc 
gtgttgtcat 
gtaccgagct 
cttctatcga 
aaccgaagtg 
aaaggtctcc 
aacagctatt 
ctttacagga 
atgccgtcac 
atagaataag 
ctgtatcggg 
tccacttaga 
tggacatgtt 
cctacggcgc 
gaattgacga 
cagaggagcg 
tctttgagca 



gctggaggta 
ctgtggcaga 
agggaccaaa 
ccctgaactg 
gtgcaagcgg 
cttagttggt 
ccaggagagg 
atgcagtgag 
caagttccga 
ttctgctgat 
tgcagatggg 
ggaagaagaa 
ggctgaggaa 
caccaccaca 
tgatgccgtt 
gaaagccaaa 
atgggaagag 
ccagcatttc 
gctggtggag 
cctggagaac 
tatgctaaag 
cgagcatgtg 
acacctccgt 
tgcagtggcc 
ttcagatgac 
tctcatgcca 
gttcagcctg 
cacagaaaac 
tcgaccaggt 
agtggaattc 
agaagatgtg 
agcgacagtg 
cggatccact 
acaagcatgc 
cgccaagtgt 
gctgactagg 
tctactgatt 
tataaaagca 
agatagattg 
tgcgacatca 
aattcccggg 
cggcgaggac 
gggggacggg 
tctggatatg 
gtacggtggg 
ccacctgtcc 
gatgcagaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2823 



<210> 32 
<211> 941 
<212> PRT 

<213> fusion protein - human 
<400> 32 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 
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Gin 


He 


Ala 


Met 


Phe 


Cys Gly 


Arg Leu Asn 






35 








40 


Asn 


Gly 


Lys 


Trp 


Asp 


Ser Asp 


Pro Ser Gly 




50 








55 




Thr 


Lys 


Glu 


Gly 


He 


Leu Gin 


Tyr Cys Gin 


65 










70 




Gin 


He 


Thr 


Asn 


Val 


Val Glu 


Ala Asn Gin 










85 




90 


Trp 


Cys 


Lys 


Arg 


Gly 


Arg Lys 


Gin Cys Lys 








100 






105 


lie 


Pro 


Tyr 


Arg 


Cys 


Leu Val 


Gly Glu Phe 






115 








120 


Val 


Pro 


Asp 


Lys 


Cys 


Lys Phe 


Leu His Gin 




130 








135 




Glu 


Thr 


His 


Leu 


His 


Trp His 


Thr Val Ala 


145 










150 




hys 


Ser 


Thr 


Asn 


Leu 


His Asp 


Tyr Gly Met 








165 




170 


Asp 


Lys 


Phe 


Arg 


Gly 


Val Glu 


Phe Val Cys 








180 






185 


Ser 


Asp 


Asn 


Val 


Asp 


Ser Ala 


Asp Ala Glu 




195 








200 


Tarp 


Trp 


Gly 


Gly 


Ala 


Asp Thr 


Asp Tyr Ala 




210 








215 




Val 


Val 


Glu 


Val 


Ala 


Glu Glu 


Glu Glu Val 


225 










230 




Glu 


Ala 


Asp 


Asp 


Asp 


Glu Asp 


Asp Glu Asp 










245 




250 


Glu 


Ala 


Glu 


Glu 


Pro 


Tyr Glu 


Glu Ala Thr 








260 




265 


Ala 


Thr 


Thr 


Thr 


Thr 


Thr Thr 


Thr Glu Ser 






275 








280 


Val 


Pro 


Thr 


Thr 


Ala 


Ala Ser 


Thr Pro Asp 




290 








295 




Glu 


Thr 


Pro 


Gly 


Asp 


Glu Asn 


Glu His Ala 


305 










310 




Glu 


Arg 


Leu 


Glu 


Ala 


Lys His 


Arg Glu Arg 










325 




330 


Glu 


Trp 


Glu 


Glu 


Ala 


Glu Arg 


Gin Ala Lys 








340 






345 


Lys 


Lys 


Ala 


Val 


He 


Gin His 


Phe Gin Glu 




355 








360 


Gin 


Glu 


Ala 


Ala 


Asn 


Glu Arg 


Gin Gin Leu 




370 








375 




Arg 


Val 


Glu 


Ala 


Met 


Leu Asn 


Asp Arg Arg 


385 










390 




Tyr 


He 


Thr 


Ala 


Leu 


Gin Ala 


Val Pro Pro 








405 




410 


Asn 


Met 


Leu 


Lys 


Lys 


Tyr Val 


Arg Ala Glu 








420 






425 


Thr 


Leu 


Lys 


His 


Phe 


Glu His 


Val Arg Met 






435 








440 


Ala 


Gin 


He 


Arg 


Ser 


Gin Val 


Met Thr His 




450 






455 




Arg 


Met 


Asn 


Gin 


Ser 


Leu Ser 


Leu Leu Tyr 


465 










470 




Glu 


Glu 


He 


Gin 


Asp 


Glu Val 


Asp Glu Leu 










485 




490 


Tyr 


Ser 


Asp 


Asp 


Val 


Leu Ala 


Asn Met He 






500 






505 


Tyr 


Gly 


Asn 


Asp 


Ala 


Leu Met 


Pro Ser Leu 




515 






520 



PCT/US03/0S4S8 



Met His 


Met 


Asn 


Val 


Gin 




45 








Thr Lys 


Thr 


Cys 


He 


Asp 


60 










Glu Val 


Tyr 


Pro 


Glu 


Leu 


75 








80 


Pro Val 


Thr 


He 


Gin 


Asn 








95 




Thr His 


Pro 


His 


Phe 


Val 






110 






He Ser 


Asp 


Ala 


Leu 


Leu 




125 








Glu Arg 


Met 


Asp 


Val 


Cys 


140 










Lys Glu 


Thr 


Cys 


Ser 


Glu 


155 








160 


Leu Leu 


Pro 


Cys 


Gly 


He 








175 




Cys Pro 


Leu 


Ala 


Glu 


Glu 




190 






Glu Asp 


Asp 


Ser 


Asp 


Val 




205 








Asp Gly 


Ser 


Glu 


Asp 


Lys 


220 










Ala Glu 


Val 


Glu 


Glu 


Glu 


235 








240 


Gly Asp 


Glu 


Val 


Glu 


Glu 






255 




Glu Arg 


Thr 


Thr 


Ser 


He 




270 






Val Glu 


Glu 


val 


Val 


Arg 




285 








Ala Val 


Asp 


Lys 


Tyr 


Leu 


300 










His Phe 


Gin 


Lys 


Ala 


Lys 


315 








320 


Met Ser 


Gin 


Val 


Met 


Arg 








335 




Asn Leu 


Pro 


Lys 


Ala 


Asp 






350 






Lys Val 


Glu 


Ser 


Leu 


Glu 


365 








Val Glu 


Thr 


His 


Met 


Ala 


380 










Arg Leu 


Ala 


Leu 


Glu 


Asn 


395 








400 


Arg Pro 


Arg 


His 


Val 


Phe 








415 




Gin Lys 


Asp 


Arg 


Gin 


His 




430 






Val Asp 


Pro 


Lys 


Lys 


Ala 




445 








Leu Arg 


Val 


He 


Tyr 


Glu 


460 










Asn Val 


Pro 


Ala 


Val 


Ala 


475 








480 


Leu Gin 


Lys 


Glu 


Gin 


Asn 






495 




Ser Glu 


Pro 


Arg 


He 


Ser 






510 






Thr Glu 


Thr 


Lys 


Thr 


Thr 



525 
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Val 


Glu 


Leu Leu 


Pro Val Asn 


Gly Glu 




530 




535 




Pro 


Trp 


His Ser 


Phe Gly Ala 


Asp Ser 


545 






550 




Glu 


Val 


Glu Pro 


Val Asp Ala 


Arg Pro 








565 




Thr 


Arg 


Pro Gly 


Ser Gly Leu 


Thr Asn 






580 




585 


Glu 


Val 


Asn Phe 


Glu Val Glu 


Phe Arg 






595 




600 


His 


His 


Gin Lys 


Leu Val Phe 


Phe Ala 




610 




615 




Gly 


Ala 


He He 


Gly Leu Met 


Val Gly 


625 






630 




He 


Val 


He Thr 


Leu Val Met 


Leu Lys 








645 




Leu 


Gly 


Ser Thr 


Ser Pro Val 


Trp Trp 






660 




665 


Leu 


Ser 


Ser He 


Glu Gin Ala 


Cys Asp 






675 




680 


Lys 


Cys 


Ser Lys 


Glu Lys Pro 


Lys Cys 




690 




695 




Trp 


Glu 


Cys Arg 


Tyr Ser Pro 


Lys Thr 


705 






710 




Ala 


His 


Leu Thr 


Glu Val Glu 


Ser Arg 








725 




Phe 


Leu 


Leu He 


Phe Pro Arg 


Glu Asp 






740 




745 


Asp 


Ser 


Leu Gin 


Asp He Lys 


Ala Leu 






755 




760 


Asp 


Asn 


Val Asn 


Lys Asp Ala 


val Thr 




770 




775 




Thr 


Asp 


Met Pro 


Leu Thr Leu 


Arg Gin 


785 




790 




Ser 


Ser 


Glu Glu 


Ser Ser Asn 


Lys Gly 








805 




Gly 


He 


Pro Gly 


Asp Leu Ala 


Pro Pro 




820 




825 


Glu 


Leu 


His Leu 


Asp Gly Glu 


Asp Val 






835 




840 


Leu 


Asp 


Asp Phe 


Asp Leu Asp 


Met Leu 




850 




855 




Pro 


Gly 


Phe Thr 


Pro His Asp 


Ser Ala 


865 






870 




Ala 


Asp 


Phe Glu 


Phe Glu Gin 


Met Phe 








885 




Glu 


Tyr 


Gly Gly 


Asp He Gin 


His Ser 






900 




905 


Val 


Thr 


Pro Glu 


Glu Arg His 


Leu Ser 






915 


920 


Glu 


Asn 


Pro Thr 


Tyr Lys Phe 


Phe Glu 




930 




935 





<210> 33 
<211> 63 
<212> PRT 
<213> human 

<400> 33 

Glu Glu He Ser Glu Val Lys Met Asp 
1 5 
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Phe Ser Leu 


Asp 


Asp 


Leu 


Gin 


540 










Val Pro Ala 


Asn 


Thr 


Glu 


Asn 


555 








560 


Ala Ala Asp 


Arg 


Gly 


Leu 


Thr 


570 






575 




He Lys Thr 


Glu 


Glu 


He 


Ser 




590 






His Asp Ser 


Gly 


Tyr 


Glu 


Val 




605 








Glu Asp Val 


Gly 


Ser 


Asn 


Lys 


620 










Gly Val Val 


He 


Ala 


Thr 


Val 


635 








640 


Lys Lys Lys 


Leu 


Gly 


Thr 


Glu 


650 






655 




Asn Ser Ala 


Asp 


He 


Lys 


Leu 






670 






He Cys Arg 


Leu 


Lys 


Lys 


Leu 




685 








Ala Lys Cys 


Leu 


Lys 


Asn 


Asn 


700 










Lys Arg Ser 


Pro 


Leu 


Thr 


Arg 


715 








720 


Leu Glu Arg 


Leu 


Glu 


Gin 


Leu 


730 






735 




Leu Asp Met 


He 


Leu 


Lys 


Met 




750 






Leu Thr Gly 


Leu 


Phe 


Val 


Gin 




765 








Asp Arg Leu 


Ala 


Ser 


Val 


Glu 


780 










His Arg He 


Ser 


Ala 


Thr 


Ser 


795 








800 


Gin Arg Gin 


Leu 


Thr 


Val 


Ser 


810 






815 




Thr Asp Val 


Ser 


Leu 


Gly 


Asp 






830 






Ala Met Ala 


His 


Ala 


Asp 


Ala 




845 








Gly Asp Gly 


Asp 


Ser 


Pro 


Gly 


860 










Pro Tyr Gly 


Ala 


Leu 


Asp 


Met 


875 








880 


Thr Asp Ala 


Leu 


Gly 


He 


Asp 


890 






895 




Gly Ala Ala 


Ala 


Asp 


Ala 


Ala 




910 






Lys Met Gin 


Gin 


Asn 


Gly 


Tyr 




925 








Gin Met Gin 


Asn 








940 










Ala Glu Phe 
1 ft 


Arg 


His 


Asp 
1 e 


Ser 
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Gly Tyr Glu Val His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala lie He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 34 
<211> 4 
<212> PRT 
<213> human 

<400> 34 
Lys Met Asp Ala 
1 



<210> 35 
<211> 7 
<212> PRT 
<213> human 

<400> 35 

Gly Val Val He Ala Thr Val 
1 5 



<210> 36 
<211> 63 
<212> PRT 
<213> human 

<400> 36 

Glu Glu He Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 37 
<211> 63 
<212> PRT 
<213> human 

<400> 37 

Glu Glu He Ser Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 

35 40 45 

He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 
50 55 60 



<210> 38 
<211> 4 
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<212> PRT 
<213> human 

<400> 38 
Asn Leu Asp Ala 
1 



<210> 39 
<211> 63 
<212> PRT 
<213> huxoan 

<400> 39 

Glu Glu He Ser Glu Val Asn Phe Glu Val Glu Phe Arg His Asp Ser 

15 10 15 

Gly Tyr Glu Val His His Gin Val Leu Val Phe Phe Ala Glu Asp Val 

20 25 30 

Gly Ser Asn Lys Gly Ala He He Gly Leu Met Val Gly Gly Val Val 



He Ala Thr Val He Val He Thr Leu Val Met Leu Lys Lys Lys 



<210> 40 
<211> 4 
<212> PRT 
<213> human 

<400> 40 
Asn Phe Glu Val 
1 



<210> 41 
<211> 5 
<212> PRT 
<213> hioman 

<400> 41 

Val Asn Phe Ala Val 
1 5 



<210> 42 
<211> 5 
<212> PRT 
<213> h\2inan 

<400> 42 

Val Lys Val Asp Ala 
1 5 



<210> 43 
<211> 5 
<212> PRT 
<213> hxunan 

<400> 43 

Trp Lys Met Asp Ala 
1 5 



35 



40 



45 



50 



55 



60 
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<210> 44 
<211> 5 
<212> PRT 
<213> human 

<400> 44 

Val Lys Ala Asp Ala 
1 5 



<210> 45 
<211> 5 
<212> PRT 
<213> human 

<400> 45 

Val Lys Lys Asp Ala 
1 5 



<210> 46 
<211> 5 
<212> PRT 
<213> hmnan 

<400> 46 

Val Lys Glu Asp Ala 
1 5 



<210> 47 
<211> 5 
<212> PRT 
<213> human 

<400> 47 

Val Lys He Asp Ala 
1 5 



<210> 48 
<211> 5 
<212> PRT 
<213> human 

<400> 48 ' 
Val Lys Met He Ala 
1 5 



<210> 49 
<211> 5 
<212> PRT 
<213> hiunan 

<400> 49 

Val Lys Met Asn Ala 
1 5 
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<210> 50 
<211> 5 
<212> PRT 
<213> human 

<400> 50 

Val Lys Met Glu Ala 
1 5 



<210> 51 
<211> 5 
<212> PRT 
<213> human 

<400> 51 

Val Lys Met Asp Glu 
1 5 



<210> 52 
<211> 5 
<212> PRT 
<213> human 

<400> 52 

Val Lys Met Asp Lys 
1 5 



<210> 53 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 53 

ggagaggata tcatggagcc agtagat 

<210> 54 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 54 

tacatggcgg ccgcctactt actgctttg 

<210> 55 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PGR primer 
<400> 55 

ggatgtgata tctttcttct tcagcatcac caagg 
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<210> 56 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 56 

ctgagatatc aagctactgt cttctatcga acaagc 36 

<210> 57 
<211> 31 
<212> m^A 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 57 

gcgcgatatc cccaccgtac tcgtcaattc c 31 

<210> 58 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 58 

ctgctgtggc ggccgcctag ttctgcatct gctc 34 

<210> 59 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 59 

tgccccgcgc ggccgcgcga tgctgcccgg 30 

<210> 60 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 60 

atggtgtggc ggccgcagac gccgctgtca cc 32 

<210> 61 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> PGR Primer 



<400> 61 

agcgcacaag cttccccgcg cagggtcgcg atgctg 



36 



<210> 62 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 62 

ggatgtaagc tttttcttct tcagcatcac caagg 35 

<210> 63 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 63 

tgcagatatc ctggagccag tagatcctag ac 32 

<210> 64 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer 
<400> 64 

gctggatatc ctctgctttg atagagaagc 30 
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